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NOVEL PREADIPOCYTE FACTOR-1 -LIKE POLYPEPTIDES 

FIELD OF THE INVENTION 

The present invention relates to nucleic acid sequences identified in human genome as 
5 encoding for novel polypeptides, more specifically for Preadipocyte factor-1-like 
polypeptides. 

All publications, patents and patent applications cited herein are incorporated in full by 
reference. 

BACKGROUND OF THE INVENTION 

10 Many novel polypeptides have been already identified by applying strict homology 
criteria to known polypeptides of the same family. However, since the actual content in 
polypeptide-encoding sequences in the human genome for Preadipocyte factor -1 -like 
polypeptides (and for any other protein family) is still unknown, the possibility still exists 
to identify DNA sequence encoding polypeptide having Preadipocyte factor-1-like 

15 polypeptide activities by applying alternative and less strict homology/structural criteria 
to the totality of Open Reading Frames (ORFs, that is, genomic sequences containing 
consecutive triplets of nucleotides coding for amino acids, not interrupted by a 
termination codon and potentially translatable in a polypeptide) present in the human 
genome. 

20 The ability for cells to make and secrete extracellular proteins is central to many 
biological processes. Enzymes, growth factors, extracellular matrix proteins and 
signalling molecules are all secreted by cells. This is through fusion of a secretory 
vesicle with the plasma membrane. In most cases, but not all, proteins are directed to 
the endoplasmic reticulum and into secretory vesicles by a signal peptide. Signal 

25 peptides are cis-acting sequences that affect the transport of polypeptide chains from 
the cytoplasm to a membrane bound compartment such as a secretory vesicle. 
Polypeptides that are targeted to the secretory vesicles are either secreted into the 
extracellular matrix or are retained in the plasma membrane. The polypeptides that are 
retained in the plasma membrane will have one or more transmembrane domains. 

30 Examples of secreted proteins that play a ce ntral role in the functioning of a cell are 
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cytokines, hormones, extracellular matrix proteins (adhesion molecules), proteases, 
and growth and differentiation factors. 

Preadipocyte factor-1 (Pref-1), also known as fetal antigen 1 (FA1), deltalike (dlk), or 
stromal cell-derived protein-1 (SCP-1) is an epidermal growth factor (EGF) repeat 
5 domain-containing transmembrane protein with an anti-adipogenic function (Smas et al 
(1999) J Biol. Chem. 274,12632-12641). 

This protein is expressed in preadipocytes, the cells that upon receiving specific signals 
differentiate into mature adipocytes, but is totally absent in the mature adipocytes, the 
specialised cells that function to accumulate fat as stored energy. It has been found 
10 that addition of soluble Preadipocyte factor-1 to differentiating 3T3-L1 preadipocyte 
cultures abolishes their differentiation to adipocytes. 

It is thought that there are a number of alternately spliced variants for Preadipocyte 
factor-1. Therefore, it is likely that Preadipocyte factor-1 acts as both an ajuxtacrine 
and a paracrine regulator of adipogenesis. Soluble Preadipocyte factor-1 is detectable 
15 in the circulation and is the result of post-translational modification. The EGF repeat 
domains of Preadipocyte factor-1 mediate cell growth and differentiation in a variety of 
biological settings. It is likely that these domain are involved in binding to putative 
Preadipocyte factor-1 receptor (s) on cells in order to maintain the preadipose 
phenotype. 

20 In developing embryos, Preadipocyte factor-1 is expressed in numerous tissues 
including pituitary, liver, lung, tongue and vertebrae, indicating that the expression of 
Preadipocyte factor-1 may be involved in preventing terminal differentiation and 
allowing cell proliferation. In adults, Preadipocyte factor-1 expression is critical for the 
adrenal cortical zona glomerulosa differentiation (Raza et al (1998) Endocr. Res. 

25 24,977-81). Failure of the zona glomerulosa to differentiate can lead to increased 
resorption of sodium, increased resorption of water, with consequent expansion of 
extracellular fluid volume and increased renal excretion of potassium. 
Thus, dysregulation of Preadipocyte factor-1 expression may lead to various conditions 
such as obesity, organomegaly, diabetes (including insulin resistance, 

30 hyperinsulinemia, hyperglycemia, hypertriglyceridemia), adrenocortical dysfunction, 
hypertrophy of cardiac and skeletal muscle, lipodystrophy, and immune system 
disorders including autoimmune diseases and immunodeficiencies, developmental 
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defects. cancer, autoimmune thyroid diseases and related disorders such as 
opthalmopathies and other pathological conditions. Particularly for this reason, 
Preadipocyte factor 1, and molecules related thereto are of significant interest in 
increasing understanding of the underlying pathways that lead to the various disease 
5 states in which these proteins are implicated, and in developing more effective gene or 
drug therapies to treat these disorders. 

SUMMARY OF THE INVENTION 

The invention is based upon the identification of an Open Reading Frame (ORF) in the 
human genome encoding a novel Preadipocyte factor-1-like polypeptide. This 
10 polypeptide will be referred to herein as the SCS0009 polypeptide. Based on the 
SCS0009 polypeptide, five other splice variants were identified: SCS0009-SV1. 
SCS0009-SV2, SCS0009-SV3, SCS0009-SV4 and SCS0009-SV5, of which SCS0009- 
SV3 has not been described before. 

Accordingly, the invention provides isolated SCS0009 polypeptides having the amino 
15 acid sequence given by SEQ ID NO: 2, SEQ ID NO: 8, their mature forms, their 
histidine tagged forms, variants, and fragments, as polypeptides having th e activity of 
Preadipocyte fector-1 -like polypeptides. The invention includes also the nucleic acids 
encoding them, vectors containing such nucleic acids, and cell containing these vectors 
or nucleic acids, as well as other related reagents such as fusion proteins, ligands, and 
20 antagonists. 

The invention provides methods for identifying and making these molecules, for 
preparing pharmaceutical compositions containing them, and for using them in the 
diagnosis, prevention and treatment of diseases. 



25 



30 



BRIEF DESCRIPTION OF THE FIGURES 
Figure 1: alignment of the SCS0009 ORF with known related polypeptide 
sequences. 

Figure 2: Clustal W alignment of predicted amino acid sequences of SCS0009 
prediction and splice variants SV3, SV4 and SV5. 

Figure 3: Clustal W alignment of precited amino acid sequences of SCS0009 with 
SV1 and SV2. 

Figure 4: Nucleotide sequence of SCS0009 prediction with translation. 
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Figure 5: Nucleotide sequence with translation of cDNA Insert in image clone 
5478078 (SCS0009-SV3). 

Figure 6 : Alignment of predicted amino acid sequence of SCS0009 with 

SCS0009-SV3. 
5 Figure 7: Map of expression vector pEAK12d. 

Figure 8: Map of Expression vector pDEST12.2. 

Figure 9: Map of pDONR 221. 

Figure 10: Map of pENTR-SCS0009SV3-6HlS. 

Figure 11: Map of pEAK12d-SCS0009SV3-6HIS. 
10 Figure 12: Map of pDEST12.2-SCS0009SV3-6HlS. 

Figure 13: Nucleotide sequence of SCS0009 prediction with translation. 

Figure 14: Nucleotide sequence with translation of cDNA insert in image clone 

3349698 (SCS0009 -SV4). 

Figure 1 5: Alignment of predicted amino acid sequence of SCS0009 with 
15 SCS0009-SV4. 

Figure 16: Map of expression vector pEAK1 2d. 

Figure 17: Map of Expression vector pDEST12.2. 

Figure 18: Map of pDONR 221. 

Figure 19: Map of pENTR-SCS0009SV4-6HIS. 
20 Figure 20: Map of pEAK12d-SCS0009SV4-6HIS. 

Figure 21: Map of pDEST12.2-SCS0009SV4-6HIS. 

Figure 22: Nucleotide sequence of SCS0009 prediction with translation. 

Figure 23: Nucleotide sequence with translation of SCS0009-SV5 PCR product 

indicating the positions of the SCS0009-AP1, -AP2, -AP3 and -AP4 primers used 
25 to generate the SCS0009 sequence. 

Figure 24: Nucleotide sequence and translation of cloned SCS0009 ORF. 

Figure 25: Map of pCR-Bluntll-TOPO-SCS0009. 

Figure 26: Map of pDONR 221 . 

Figure 27: Map of expression vector pEAK12d. 
30 Figure 28: Map of Expression vector pDEST12.2. 

Figure 29: Map of pENTR-SCS0009-6HIS. 

Figure 30: Map of pEAK12d-SCS0009-6HIS. 

Figure 31: Map of pDEST12.2-SCS0009-6HIS. 

Figure 32. SMART Domains alignment of the SCS0009 polypeptides. 
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DETAILED DESCRIPTION OF THE INVENTION 

According to a first aspect of the present invention there is provided an isolated 
polypeptide having Preadipocyte factor-1-like activity selected from the group 
consisting of: 

5 a) the amino acid sequences as recited in SEQ ID NO: 2 or SEQ ID NO: 8; 

b) the mature form of the polypeptides whose sequence are recited in SEQ ID 
NO: 2 (SEQ ID NO:3) or SEQ ID NO: 8 (SEQ ID NO:9); 

c) the histidine tagged form of the polypeptides whose sequence are recited in 
SEQ ID NO: 2 (SEQ ID NO:4) or SEQ ID NO: 8 (SEQ ID NO:10); 

10 d) a variant of the amino acid sequences recited in SEQ ID NO: 2 or SEQ ID 

NO: 8, wherein any amino acid specified in the chosen sequence is non- 
conseivatively substituted, provided that no more than 15% of the amino acid 
residues in the sequence are so changed; 
e) an active fragment, precursor, salt, or derivative of the amino acid sequences 
15 given in a) to d). 

The novel polypeptide described herein was identified on the basis of a consensus 
sequence for human Preadipocyte factor-1 -like polypeptides in which the number and 
the positioning of selected amino acids are defined for a protein sequence having a 
length comparable to known Preadipocyte factor-1 -like polypeptides. 
20 The totality of amino acid sequences obtained by translating the known ORFs in the 
human genome were challenged using this consensus sequence, and the positive hits 
were further screened for the presence of predicted specific structural and functional 
"signatures" that are distinctive of a polypeptide of this nature, and finally selected by 
comparing sequence features with known Preadipocyte factor-1 -like polypeptides. 
25 Therefore, the novel polypeptides of the invention can be predicted to have 
Preadipocyte factor-1 -like activities. 

The terms "active" and "activity" refer to the Preadipocyte factor-1 -like properties 
predicted for the Preadipocyte factor-1 -like polypeptide whose amino acid sequence is 
presented in SEQ ID NO: 2, SEQ ID NO: 3, SEQ ID NO: 4, SEQ ID NO: 8, SEQ ID NO: 
30 9 or SEQ ID NO: 10 in the present application. These properties include the ability to 
prevent terminal differentiation of preadipocytes and allow cell proliferation. 
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In a second aspect, the invention provides a purified nucleic acid molecule which 
encodes a polypeptide of the first aspect of the invention. Preferably, the purified 
nucleic acid molecule comprises or consists of the nucleic acid sequences as recited in 
SEQ ID NO:1 or SEQ ID IMO:7 (encoding the Preadipocyte factor -1-like polypeptides 
5 whose amino acid sequences are recited in SEQ ID NO:2 or SEQ ID NO:8). particularly 
the coding sequence which starts at nucleotide 122 and ends at 1180 in SEQ ID NO:1 
or starts at nucleotide 1 and ends at 1 131 in SEQ ID IMO:8. 

In a third aspect, the invention provides a purified nucleic acid molecule which 
hydridizes under high stringency conditions with a nucleic acid molecule of the second 
10 aspect of the invention. 

In a fourth aspect, the invention provides a vector, such as an expression vector, that 
contains a nucleic acid molecule of the second or third aspect of the invention. 
In a fifth aspect, the invention provides a host cell transformed with a vector of the 
fourth aspect of the invention. 
15 In a sixth aspect, the invention provides a ligand which binds specifically to, and which 
preferably inhibits the metalloprotease activity of a polypeptide of the first aspect of the 
invention. Ligands to a polypeptide according to the invention may come in various 
forms, including natural or modified substrates, enzymes, receptors, small organic 
molecules such as small natural or synthetic organic molecules of up to 2000Da. 
preferably 800Da or less, peptidomimetics, inorganic molecules, peptides, 
polypeptides, antibodies, structural or functional mimetics of the aforementioned. 
In a seventh aspect, the invention provides a compound that is effective to alter the 
expression of a natural gene which encodes a polypeptide of the first aspect of the 
invention or to regulate the activity of a polypeptide of the first aspect of the invention. 
A compound of the seventh aspect of the invention may either increa se (agonise) or 
decrease (antagonise) the level of expression of the gene or the activity of the 
polypeptide. Importantly, the identification of the function of the Preadipocyte factor -1- 
like polypeptide of the invention allows for the design of screening methods capable of 
identifying compounds that are effective in the treatment and/or diagnosis of disease. 
30 In an eighth aspect, the invention provides a polypeptide of the first aspect of the 
invention, or a nucleic acid molecule of the second or third aspect of the invention, or a 
vector of the fourth aspect of the invention, or a host cell of the fifth aspect of the 
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invention, or a ligand of the sixth aspect of the invention, or a compound of the seventh 
aspect of the invention, for use in therapy or diagnosis. These molecules may also be 
used in the manufacture of a medicament for the treatment of obesity, organomegaly, 
diabetes (including insulin resistance, hyperinsulinemia, hyperglycemia, 
hypertriglyceridemia), adrenocortical dysfunction, hypertrophy of cardiac and skeletal 
muscle, lipodystrophy, and immune system disorders including autoimmune diseases 
and immunodeficiencies, developmental defects, cancer, autoimmune thyroid diseases 
and related disorders such as opthalmopathies and other pathological conditions. 
In a ninth aspect, the invention provides a method of diagnosing a disease in a patient, 
comprising assessing the level of expression of a natural gene encoding a polypeptide 
of the first aspect of the invention or the activity of a polypeptide of the first aspect of 
the invention in tissue from said patient and comparing said level of expression or 
activity to a control level, wherein a level that is different to said control level is 
indicative of disease. Such a method will preferably be carried out in vitro. Similar 
15 methods may be used for monitoring the therapeutic treatment of disease in a patient, 
wherein altering the level of expression or activity of a polypeptide or nucleic acid 
molecule over the period of time towards a control level is indicative of regression of 
disease. 

A preferred method for detecting polypeptides of the first aspect of the invention 
20 comprises the steps of: (a) contacting a ligand. such as an antibody, of the sixth aspect 
of the invention with a biological sample under conditions suitable for the formation of a 
ligand-polypeptide complex; and (b) detecting said complex. 

A number of different such methods according to the ninth aspect of the invention exist, 
as the skilled reader will be aware, such as methods of nucleic acid hybridization with 
25 short probes, point mutation analysis, polymerase chain reaction (PCR) amplification 
and methods using antibodies to detect aberrant protein levels. Similar methods may 
be used oh a short or long term basis to allow therapeutic treatment of a disease to be 
monitored in a patient. The invention also provides kits that are useful in these methods 
for diagnosing disease. 

30 In a tenth aspect, the invention provides for the use of a polypeptide of the first aspect 
of the invention as a Preadipocyte factor-1-like protein. Suitable uses include use as a 
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secreted glycoprotein, in particular in the context of preventing terminal differentiation 
of preadipocytes and allowing cell proliferation. 

In an eleventh aspect, the invention provides a pharmaceutical composition comprising 
a polypeptide of the first aspect of the invention, or a nucleic acid molecule of the 
5 second or third aspect of the invention, or a vector of the fourth aspect of the invention, 
or a host cell of the fifth aspect of the invention, or a ligand of the sixth aspect of the 
invention, or a compound of the seventh aspect of the invention, in conjunction with a 
pharmaceutically-acceptable carrier. 

In a twelfth aspect, the present invention provides a polypeptide of the first aspect of 
10 the invention, or a nucleic acid molecule of the second or third aspect of the invention, 
or a vector of the fourth aspect of the invention, or a host cell of the fifth aspect of the 
invention, or a ligand of the sixth aspect of the invention, or a compound of the seventh 
aspect of the invention, for use in the manufacture of a medicament for the diagnosis or 
treatment of a disease, such as for the treatment of prevention of obesity, 
15 organomegaly, diabetes (including insulin resistance, hyperinsuli nemia, hyperglycemia, 
hypertriglyceridemia), adrenocortical dysfunction, hypertrophy of cardiac and skeletal 
muscle, lipodystrophy, and immune system disorders including autoimmune diseases 
and immunodeficiencies, developmental defects, cancer, autoimmune thyroid diseases 
and related disorders such as opthalmopathies and other pathological conditions. 

20 In a thirteenth aspect, the invention provides a method of treating a disease in a patient 
comprising administering to the patient a polypeptide of the first aspect of the invention, 
or a nucleic acid molecule of the second or third aspect of the invention, or a vector of 
the fourth aspect of the invention, or a host cell of the fifth aspect of the invention, or a 
ligand of the sixth aspect of the invention, or a compound of the seventh aspect of the 

25 invention. 

For diseases in which the expression of a natural gene encoding a polypeptide of the 
first aspect of the invention, or in which the activity of a polypeptide of the first aspect of 
the invention, is lower in a diseased patient when compared to the level of expression 
or activity in a healthy patient, the polypeptide, nucleic acid molecule, ligand or 
30 compound administered to the patient should be an agonist Conversely, for diseases 
in which the expression of the natural gene or activity of the polypeptide is higher in a 
diseased patient when compared to the level of expression or activity in a healthy 
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patient, the polypeptide, nucleic acid molecule, ligand or compound administered to the 
patient should be an antagonist. Examples of such antagonists include antisense 
nucleic acid molecules, ribozymes and ligands, such as antibodies. 
In a fourteenth aspect, the invention provides transgenic or knockout non -human 
5 animals that have been transformed to express higher, lower or absent levels of a 
polypeptide of the first aspect of the invention. Such transgenic animals are very useful 
models for the study of disease and may also be used in screening regimes for the 
identification of compounds that are effective in the treatment or diagnosis of such a 
disease. 

10 A summary of standard techniques and procedures which may be employed in order to 
utilise the invention is given below. It will be understood that this invention is not limited 
to the particular methodology, protocols, cell lines, vectors and reagents described. It is 
also to be understood that the terminology used herein is for the purpose of describing 
particular embodiments only and it is not intended that this terminology should limit the 

15 scope of the present invention. The extent of the invention is limited only by the terms 
of the appended claims. 

Standard abbreviations for nucleotides and amino acids are used in this specification. 
The practice of the present invention will employ, unless otherwise indicated, 
conventional techniques of molecular biology, microbiology, recombinant DNA 
20 technology and immunology, which are within the skill of the those working in the art 
Such techniques are explained fully in the literature. Examples of particularly suitable 
texts for consultation include the following: Sambrook Molecular Cloning; A Laboratory 
Manual, Second Edition (1989); DNA Cloning, Volumes I and II (D.N Glover ed. 1985); 
Oligonucleotide Synthesis (M.J. Gait ed. 1984); Nucleic Acid Hybridization (B.D. 

25 Hames & S.J. Higgins eds. 1984); Transcription and Translation (B.D. Hames & S.J. 
Higgins eds. 1984); Animal Cell Culture (R.I. Freshney ed. 1986); Immobilized Cells 
and Enzymes (IRL Press, 1986); B. Perbal, A Practical Guide to Molecular Cloning 
(1984); the Methods in Enzymology series (Academic Press, Inc.), especially volumes 
154 & 155; Gene Transfer Vectors for Mammalian Cells (J.H. Miller and MP. Calos 

30 eds. 1987, Cold Spring Harbor Laboratory); Immunochemical Methods, in Cell and 
Molecular Biology (Mayer and Walker, eds. 1987, Academic Press, London); Scopes, 
(1987) Protein Purification: Principles and Practice, Second Edition (Springer Verlag, 
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N.Y.); and Handbook of Experimental Immunology, Volumes I -IV (D.M. Weir and C. C. 
Blackwell eds. 1986). 

The first aspect of the invention includes variants of the amino acid sequence recited in 
SEQ ID NO: 2, SEQ ID NO: 3, SEQ ID NO: 4. SEQ ID NO: 8, SEQ ID NO: 9 or SEQ ID 

5 NO: 1 0 wherein any amino acid specified in the chosen sequence is non -conservatively 
substituted, provided that no more than 15% of the amino acid residues in the 
sequence are so changed. Protein sequences having the indicated number of non - 
conservative substitutions can be identified using commonly available bioinformatic 
tools (Mulder NJ and Apweiler R, 2002; Rehm BH, 2001). 

10 In addition to such sequences, a series of polypeptides forms part of the disclosure of 
the invention. Being Preadipocyte factor-1-like polypeptides known to go through 
maturation processes including the proteolytic removal of N -terminal sequences (by 
signal peptidases and other proteolytic enzymes), the present application also claims 
the mature form of the polypeptide whose sequence is recited in SEQ ID NO: 2 (SEQ 

15 ID NO: 4) or SEQ ID NO: 8 (SEQ ID NO: 9). Mature forms are inten ded to include any 
polypeptide showing Preadipocyte factor-1-like activity and resulting from in vivo (by 
the expressing cells or animals) or in vitro (by modifying the purified polypeptides with 
specific enzymes) post-translational maturation processes. Other alternative mature 
forms can also result from the addition of chemical groups such as sugars or 

20 phosphates. The present application also claims the histidine tagged forms of the 
polypeptides whose sequences are recited in SEQ ID NO: 2 (SEQ ID NO:4) or SEQ ID 
NO:8(SEQIDNO:10). 

Other claimed polypeptides are the active variants of the amino acid sequences given 
by SEQ ID NO: 2, SEQ ID NO: 3, SEQ ID NO: 4. SEQ ID NO: 8, SEQ ID NO: 9 or SEQ 
25 ID NO: 10 wherein any amino acid specified in the chosen sequence is non- 
conservatively substituted, provided that no more than 15%, preferably no more that 
10%. 5%. 3%. or 1%, of the amino acid residues in the sequence are so changed. The 
indicated percentage has to be measured over the novel amino acid sequences 
disclosed. 

30 In accordance with the present invention, any substitution should be preferably a 
"conservative" or "safe" substitution, which is commonly defined a substitution 
introducing an amino acids having sufficiently similar chemical properties (eg a basic, 
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positively charged amino acid should be replaced by another basic, positively charged 
amino acid), in order to preserve the structure and the biological function of the 
molecule. 

The literature provide many models on which the selection of conservative amino acids 
substitutions can be performed on the basis of statistical and physico-chemical studies 
on the sequence and/or the structure of proteins (Rogov SI and Nekrasov AN, 2001). 
Protein design experiments have shown that the use of specific subsets of amino acids 
can produce foldable and active proteins, helping in the classification of amino acid 
"synonymous" substitutions which can be more easily accommodated in protein 
structure, and which can be used to detect functional and structural homologs and 
paralogs (Murphy LR ef a/., 2000). The groups of synonymous amino acids and the 
groups of more preferred synonymous amino acids are shown in Table I. 
Active variants having comparable, or even improved, activity with respect of 
corresponding Preadipocyte factor-1-like polypeptides may result from conventional 
15 mutagenesis technique of the encoding DNA. from combinatorial technologies at the 
level of encoding DNA sequence (such as DNA shuffling, phage display/selection), or 
from computer-aided design studies, followed by the validation for the desired activities 
as described in the prior art. 

Specific, non -conservative mutations can be also introduced in the polypeptides of the 
20 invention with different purposes. Mutations reducing the affinity of the Preadipocyte 
factor-1-like polypeptide may increase its ability to be reused and recycled, potentially 
increasing its therapeutic potency (Robinson CR, 2002). Immunogenic epitopes 
eventually present in the polypeptides of the invention can be exploited for develo ping 
vaccines (Stevanovic S. 2002), or eliminated by modifying their sequence following 
25 known methods for selecting mutations for increasing protein stability, and correcting 
them (van den Burg B and Eijsink V, 2002; WO 02/05146, WO 00/34317, WO 
98/52976). 

Further alternative polypeptides of the invention are active fragments, precursors, salts, 
or functionally-equivalent derivatives of the amino acid sequences described above. 
30 Fragments should present deletions of terminal or internal amino acids not alte ring their 
function, and should involve generally a few amino acids, e.g., under ten, and 
preferably under three, without removing or displacing amino acids which are critical to 
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the functional conformation of the proteins. Small fragments may form an antigenic 
determinant 

The "precursors" are compounds which can be converted into the compounds of 
present invention by metabolic and enzymatic processing prior or after the 
5 administration to the cells or to the body. 

The term "salts" herein refers to both salts of carboxyl groups and to acid addition salts 
of amino groups of the polypeptides of the present invention. Salts of a carboxyl group 
may be formed by means known in the art and include inorganic salts, for example, 
sodium, calcium, ammonium, ferric or zinc salts, and the like, and salts with organic 
10 bases as those formed, for example, with amines, such as triethanolamine, arginine or 
lysine, piperidine, procaine and the like. Acid addition salts include, for example, salts 
with mineral acids such as, for example, hydrochloric acid or sulfuric acid, and salts 
with organic acids such as, for example, acetic acid or oxalic acid. Any of such salts 
should have substantially similar activity to the peptides and polypeptides of the 
15 invention or their analogs. 

The term "derivatives" as herein used refers to derivatives which can be prepared from 
the functional groups present on the lateral chains of the amino acid moieties or on the 
amino- or carboxy-terminal groups according to known methods. Such molecules can 
result also from other modifications which do not normally alter primary sequence, for 
20 example in vivo or in vitro chemical derivativization of polypeptides (acetylation or 
carboxylation), those made by modifying the pattern of phosphorylation (introduction of 
phosphotyrosine, phosphoserine, or phosphothreonine residues) or glycosylate (by 
exposing the polypeptide to mammalian glycosylating enzymes) of a peptide during its 
synthesis and processing or in further processing steps. Alternatively, derivatives may 
25 include esters or aliphatic amides of the carboxyl -groups and N-acyl derivatives of free 
amino groups or O-acyl derivatives of free hydroxyl-groups and are formed with acyl- 
groups as for example alcanoyl- or aryl-groups. 

The generation of the derivatives may involve a site-directed modification of an 
appropriate residue, in an internal or terminal position. The residues used for 
30 attachment should they have a side-chain amenable for polymer attachment (i.e., the 
side chain of an amino acid bearing a functional group, e.g., lysine, aspartic acid, 
glutamic acid, cysteine, histidine, etc.). Alternatively, a residue having a side chain 
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amenable for polymer attachment can replace an amino acid of the polypeptide, or can 
be added in an internal or terminal position of the polypeptide. Also, the side chains of 
the genetically encoded amino acids can be chemically modified for polymer 
attachment, or unnatural amino acids with appropriate side chain functional groups can 

5 be employed. The preferred method of attachment employs a combination of peptide 
synthesis and chemical ligation. Advantageously, the attachment of a water-soluble 
polymer will be through a biodegradable linker, especially at the amino -terminal region 
of a protein. Such modification acts to provide the protein in a precursor (or "pro -drug") 
form, that, upon degradation of the linker releases the protein without polymer 

10 modification. 

Polymer attachment may be not only to the side chain of the amino acid naturally 
occurring in a specific position of the antagonist or to the side chain of a natural or 
unnatural amino acid that replaces the amino acid naturally occurring in a specific 
position of the antagonist, but also to a carbohydrate or other moiety that is attached to 
the side chain of the amino acid at the target position. Rare or unnatural amino acids 
can be also introduced by expressing the protein in specifically engineered bacterial 
strains (Bock A, 2001). 

All the above indicated variants can be natural, being identified in organisms other tha n 
humans, or artificial, being prepared by chemical synthesis, by site -directed 
20 mutagenesis techniques, or any other known technique suitable thereof, which provide 
a finite set of substantially corresponding mutated or shortened peptides or 
polypeptides which can be routinely obtained and tested by one of ordinary skill in the 
art using the teachings presented in the prior art. 

The novel amino acid sequences disclosed in the present patent application can be 
25 used to provide different kind of reagents and molecules. Examples of these 
compounds are binding proteins or antibodies that can be identified using their full 
sequence or specific fragments, such as antigenic determinants. Peptide libraries can 
be used in known methods (Tribbick G, 2002) for screening and characterizing 
antibodies or other proteins binding the claimed amino acid sequences, and for 
30 identifying alternative forms of the polypeptides of the invention having similar binding 
properties. 
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The present patent application discloses also fusion proteins comprising any of the 
polypeptides described above. These polypeptides should contain protein sequence 
heterologous to the one disclosed in the present patent application, without significantly 
impairing the Preadipocyte factor-1-like activity of the polypeptide and possibly 

5 providing additional properties. Examples of such properties are an easier purification 
procedure, a longer lasting half-life in body fluids, an additional binding moiety, the 
maturation by means of an endoproteolytic digestion, or extracellular localization. This 
latter feature is of particular importance for defining a specific group of fusion or 
chimeric proteins included in the above definition since it allows the claimed molecules 

10 to be localized in the space where not only isolation and purification of these 
polypeptides is facilitated, but also where generally Preadipocyte factor -1 -like 
polypeptides and their receptor interact 

Design of the moieties, ligands, and linkers, as well methods and strategies for the 
construction, purification, detection and use of fusion proteins are disclosed in the 

15 literature (Nilsson J ef a/., 1997; Methods Enzymol, Vol. 326-328, Academic Press, 
2000). The preferred one or more protein sequences which can be comprised in the 
fusion proteins belong to these protein sequences: membrane -bound protein, 
immunoglobulin constant region, multimerization domains, extracellular proteins, signal 
peptide-containing proteins, export signal -containing proteins. Features of these 

20 sequences and their specific uses are disclosed in a detailed manner, for example, for 
albumin fusion proteins (WO 01/77137), fusion proteins including multimerization 
domain (WO 01/02440, WO 00/24782), immunoconjugates (Garnett MC, 2001), or 
fusion protein providing additional sequences which can be used for purifying the 
recombinant products by affinity chromatography (Constans A, 2002; Burgess RR and 

25 Thompson NE, 2002; Lowe CR ef a/., 2001; J. Bioch. Biophy. Meth., vol. 49 (1-3), 
2001; Sheibani N, 1999). 

The polypeptides of the invention can be used to generate and characterize ligands 
binding specifically to them. These molecules can be natural or artificial, very different 
from the chemical point of view (binding proteins, antibodies, molecularly imprinted 
30 polymers), and can be produced by applying the teachings in the art (WO 02/74938; 
Kuroiwa Y ef a/., 2002; Haupt K, 2002; van Dijk MA and van de Winkel JG, 2001; 
Gavilondo JV and Larrick JW, 2000). Such ligands can antagonize or inhibit the 
Preadipocyte factor-1 -like activity of the polypeptide against which they have been 
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generated. In particular, common and efficient ligands are represented by extracellular 
domain of a membrane-bound protein or antibodies, which can be in the form 
monoclonal, polyclonal, humanized antibody, or an antigen binding fragment 
The polypeptides and the polypeptide -based derived reagents described above can be 
5 in alternative forms, according to the desired method of use and/or production, such as 
active conjugates or complexes with a molecule chosen amongst radioactive labels, 
fluorescent labels, biotin. or cytotoxic agents. 

Specific molecules, such as peptide mimetics. can be also designed on the sequence 
and/or the structure of a polypeptide of the invention. Peptide mimetics (also called 
10 peptidomimetics) are peptides chemically modified at the level of amino acid side 
chains, of amino acid chirality. and/or of the peptide backbone. These alterations are 
intended to provide agonists or antagonists of the polypeptides of the invention with 
improved preparation, potency and/or pharmacokinetics features. 
For example, when the peptide is susceptible to cleavage by peptidases following 
15 injection into the subject is a problem, replacement of a particularly sensitive peptide 
bond with a non-cleavable peptide mimetic can provide a peptide more stable and thus 
more useful as a therapeutic. Similarly, the replacement of an L -amino acid residue is a 
standard way of rendering the peptide less sensitive to proteolysis, and finally more 
similar to organic compounds other than peptides. Also useful are amino-terminal 
20 blocking groups such as t-butyloxycarbonyl. acetyl, theyl, succinyl, methoxysuccinyl. 
suberyl, adipyl, azelayl. dansyl. benzyloxycarbonyl, fluorenylmethoxycarbonyl. 
methoxyazelayl, methoxyadipyl. methoxysuberyl. and 2,4-dinitrophenyl. Many other 
modifications providing increased potency, prolonged activity, easiness of purification, 
and/or increased half-life are disclosed in the prior art (WO 02/10195; Villain M et a/., 
25 2001). 

Preferred alternative, synonymous groups for amino acids derivatives included in 
peptide mimetics are those defined in Table II. A non -exhaustive list of amino acid 
derivatives also include aminoisobutyric acid (Aib). hydroxyproline (Hyp). 1.2,3.4- 
tetrahydro-isoquinoline-3-COOH, indo!ine-2carboxylic acid, 4-difluoro-proline. L- 
thiazolidine-4-carboxylic acid. L-homoproline, 3.4-dehydro-proline, 3,4-dihydroxy- 
phenylalanine. cyclohexyl-giycine, and phenylglycine. 



30 
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By "amino acid derivative" is intended an amino acid or amino acid -like chemical entity 
other than one of the 20 genetically encoded naturally occurring amino acids. In 
particular, the amino acid derivative may contain substituted or non -substituted, linear, 
branched, or cyclic alkyl moieties, and may include one or more heteroatoms. The 
5 amino acid derivatives can be made de novo or obtained from commercial sources 
(Calbiochem-Novabiochem AG, Switzerland; Bachern, USA). 

Various methodologies for incorporating unnatural amino acids derivatives into 
proteins, using both in vitro and in vivo translation systems, to probe and/or improve 
protein structure and function are disclosed in the literature (Dougherty DA, 2000). 
10 Techniques for the synthesis and the development of peptide mimetics, as well as non - 
peptide mimetics. are also well known in the art (Golebiowski A et a/., 2001; Hruby VJ 
and Balse PM, 2000; Sawyer TK, in "Structure Based Drug Design", edited by 
Veerapandian P, Marcel Dekker Inc., pg. 557-663, 1997). 

Another object of the present invention are isolated nucleic acids enco ding for the 
15 polypeptides of the invention having Preadipocyte factor-1-like activity, the 
polypeptides binding to an antibody or a binding protein generated against them, the 
corresponding fusion proteins, or mutants having antagonistic activity as disclo sed 
above. Preferably, these nucleic acids should comprise a DNA sequence selected from 
the group consisting of SEQ ID NO: 1, a polynucleotide defined by nucleotides 122 to 
20 1180 of SEQ ID NO:1, of SEQ ID NO: 7, a polynucleotide defined by nucleotides 1 to 
1 131 of SEQ ID NO:7 t or the complement of either of said DNA sequences. 
Alternatively, the nucleic acids of the invention should hybridize under high stringency 
conditions, or exhibit at least about 85% identity over a stretch of at least about 30 
nucleotides, with a nucleic acid consisting of SEQ ID NO: 1, a polynucleotide defined 
by nucleotides 122 to 1180 of SEQ ID NO:1, of SEQ ID NO: 7, a polynucleotide defined 
by nucleotides 1 to 1131 of SEQ ID NO:7, or the complement of either of said DNA 
sequences. 

The wording "high stringency conditions" refers to conditions in a hybridization reaction 
that facilitate the association of very similar molecules and consist in the overnight 
30 incubation at 60-65°C in a solution comprising 50 % formamide, 5X SSC (150 m M 
NaCI, 15 m M trisodium citrate), 50 mM sodium phosphate (p H 7 6), 5x Denhardt's 
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solution, 10 % dextran sulphate, and 20 microgram/ml denatured, sheared salmon 
sperm DNA, followed by washing the filters in 0.1X SSC at the same temperature. 
These nucleic acids, including nucleotide sequences substantially the same, can be 
comprised in plasmids, vectors and any other DNA construct which can be used for 
5 maintaining, modifying, introducing, or expressing the encoding polypeptide. In 
particular, vectors wherein said nucleic acid molecule is operatively linked to 
expression control sequences can allow expression in prokaryotic or eukaryotic host 
cells of the encoded polypeptide. 

The wording "nucleotide sequences substantially the same" includes all other nucleic 
10 acid sequences which, by virtue of the degeneracy of the genetic code, also code for 
the given amino acid sequences. In this sense, the literature provides indications on 
preferred or optimized codons for recombinant expression (Kane JF ef a/.. 1995). 
The nucleic acids and the vectors can be introduced into cells with different purposes, 
generating transgenic cells and organisms. A process for producing cells capable of 
15 expressing a polypeptide of the invention comprises genetically engineering cells with 
such vectors and nucleic acids. 

In particular, host cells (e.g. bacterial cells) can be modified by transformation for 
allowing the transient or stable expression of the polypeptides encoded by the nucleic 
acids and the vectors of the invention. Alternatively, said molecules can be used to 
20 generate transgenic animal cells or non-human animals (by non- / homologous 
recombination or by any other method allowing their stable integration and 
maintenance), having enhanced or reduced expression levels of the polypeptides of 
the invention, when the level is compared with the normal expression levels. Such 
precise modifications can be obtained by making use of the nucleic acids of the 
25 inventions and of technologies associated, for example, to gene therapy (Meth. 
Enzymol., vol. 346. 2002) or to site -specific recombinases (Kolb AF. 2002). Model 
systems based on the Preadipocyte factor-1-like polypeptides disclosed in the present 
patent application for the systematic study of their function can be also generated by 
gene targeting into human cell lines (Bunz F, 2002). 
30 Gene silencing approaches may also be undertaken to down -regulate endogenous 
expression of a gene encoding a polypeptide of the invention. RNA interference (RNAi) 
(Elbashir, SM ef a/.. Nature 2001. 411. 494-498) is one method of sequence specific 
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post-transcriptional gene silencing that may be employed. Short dsRNA 
oligonucleotides are synthesised in vitro and introduced into a cell. The sequence 
specific binding of these dsRNA oligonucleotides triggers the d egradation of target 
mRNA, reducing or ablating target protein expression. 

Efficacy of the gene silencing approaches assessed above may be assessed through 
the measurement of polypeptide expression (for example, by Western blotting), and at 
the RNA level using TaqMan-based methodologies. 

The polypeptides of the invention can be prepared by any method known in the art, 
including recombinant DNA-related technologies, and chemical synthesis technologies. 
In particular, a method for making a polypeptide of the invention may comprise 
culturing a host or transgenic cell as described above under conditions in which the 
nucleic acid or vector is expressed, and recovering the polypeptide encoded by said 
nucleic acid or vector from the culture. For example, when the vector expresses the 
polypeptide as a fusion protein with an extracellular or signal -peptide containing 
15 proteins, the recombinant product can be secreted in the extracellular space, and can 
be more easily collected and purified from cultured cells in view of further processing 
or, alternatively, the cells can be directly used or administered. 

The DNA sequence coding for the proteins of the invention can be inserted and ligated 
into a suitable episomal or non- / homologously integrating vectors, which can be 

20 introduced in the appropriate host cells by any suitable means (transformation, 
transfection, conjugation, protoplast fusion, electroporation, calcium phosphate - 
precipitation, direct microinjection, etc.). Factors of importance in selecting a particular 
plasmid or viral vector include: the ease with which recipient cells that contain the 
vector, may be recognized and selected from those recipient cells which do not contain 

25 the vector, the number of copies of the vector which are desired in a particular host; 
and whether it is desirable to be able to "shuttle" the vector between host cells of 
different species. 

The vectors should allow the expression of the isolated or fusion protein including the 
polypeptide of the invention in the Prokaryotic or Eukaryotic host cells under the control 
30 of transcriptional initiation / termination regulatory sequences, which are chosen to be 
constitutively active or inducible in said cell. A cell line substantially enriched in such 
cells can be then isolated to provide a stable cell line. 
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For Eukaryotic hosts (e.g. yeasts, insect, plant, or mammalian cells), different 
transcriptional and translations regulatory sequences may be employed, depending on 
the nature of the host They may be derived form viral sources, such as adenovirus, 
bovine papilloma virus, Simian virus or the like, where the regulatory signals are 
5 associated with a particular gene which has a high level of expression. Examples are 
the TK promoter of the Herpes virus, the SV40 early promoter, the yeast ga!4 gene 
promoter, etc. Transcriptional initiation regulatory signals may be selected which allow 
for repression and activation, so that expression of the genes can be modulated. The 
cells stably transformed by the introduced DNA can be selected by introducing one or 
10 more markers allowing the selection of host cells which contain the expression vector. 
The marker may also provide for phototrophy to an auxotropic host, biocide resistance, 
e.g. antibiotics, or heavy metals such as copper, or the like. The selectable marker 
gene can either be directly linked to the DNA gene sequences to be expressed, or 
introduced into the same cell by co-transfection. 
15 Host cells may be either prokaryotic or eukaryotic. Preferred are eukaryotic hosts, e.g. 
mammalian cells, such as human, monkey, mouse, and Chinese Hamster Ovary 
(CHO) cells, because they provide post-translationai modifications to proteins, 
including correct folding and glycosylation. Also yeast cells can carry out post- 
translational peptide modifications including glycosylation. A number of recombinant 
20 DNA strategies exist which utilize strong promoter sequences and high copy number of 
plasmids which can be utilized for production of the desired proteins in yeast. Yeast 
recognizes leader sequences in cloned mammalian gene products and secretes 
peptides bearing leader sequences (i.e., pre -peptides). 

The above mentioned embodiments of the invention can be achieved by combining the 
25 disclosure provided by the present patent application on the sequence of novel 
Preadipocyte factor-1-iike polypeptides with the knowledge of common molecular 
biology techniques. 

Many books and reviews provides teachings on how to clone and produce recombinant 
proteins using vectors and Prokaryotic or Eukaryotic host cells, such as some titles in 
30 the series "A Practical Approach 0 published by Oxford University Press ("DNA Cloning 
2: Expression Systems", 1995; "DNA Cloning 4: Mammalian Systems", 1996; "Protein 
Expression", 1999; "Protein Purification Techniques", 2001). 
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Moreover, updated and more focused literature provides an overview of the 
technologies for expressing polypeptides in a high -throughput manner (Chambers SP, 
2002; Coleman TA, et a/., 1997), of the cell systems and the processes used 
industrially for the large-scale production of recombinant proteins having therapeutic 

5 applications (Andersen DC and Krummen L, 2002, Chu L and Robinson DK, 2001), 
and of alternative eukaryotic expression systems for expressing the polypeptide of 
interest, which may have considerable potential for the economic production of the 
desired protein, such the ones based on transgenic plants (Giddings G, 2001) or the 
yeast Pichia pastoris (Lin Cereghino GP ef a/., 2002). Recombinant protein products 

10 can be rapidly monitored with various analytical technologies during purification to 
verify the amount and the quantity of the expressed polypeptides (Baker KN et a/., 
2002), as well as to check if there is problem of bioequivalence and immunogenicity 
(Schellekens H, 2002; Gendel SM, 2002). 

Totally synthetic Preadipocyte factor-1-iike polypeptides are disclosed in the literature 
15 and many examples of chemical synthesis technologies, which can be effectively 
applied for the Preadipocyte factor-1-like polypeptides of the invention given their short 
length, are available in the literature, as solid phase or liquid phase synthesis 
technologies, for example, the amino acid corresponding to the carboxy -terminus of the 
peptide to be synthesized is bound to a support which is insoluble in organic solvents, 
20 and by alternate repetition of reactions, one wherein amino acids with their amino 
groups and side chain functional groups protected with appropriate protective groups 
are condensed one by one in order from the carboxy-terminus to the amino-terminus, 
and one where the amino acids bound to the resin or the protective group of the amino 
groups of the peptides are released, the peptide chain is thus extended in this manner. 
25 Solid phase synthesis methods are largely classified by the tBoc method and the Fmoc 
method, depending on the type of protective group used. Typically used protective 
groups include tBoc (t-butoxycarbonyl), Cl-Z (2-chlorobenzyloxycarbonyl), Br-Z (2- 
bromobenzyloxycarbonyl), Bzl (benzyl), Fmoc (9-fluorenylmethoxycarbonyl), Mbh (4,4*- 
dimethoxydibenzhydryl), Mtr (4-methoxy-2,3,6-trimethylbenzenesulphonyl), Trt (trityl), 
30 Tos (tosyl), Z (benzyloxycarbonyl) and CI2-Bzl (2,6-dichlorobenzyl) for the amino 
groups; N02 (nitro) and Pmc (2,2,5,7 1 8-pentamethylchromane-6-sulphonyl) for the 
guanidino groups); and tBu (t-butyl) for the hydroxyl groups). After synthesis of the 
desired peptide, it is subjected to the de -protection reaction and cut out from the solid 
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support. Such peptide cutting reaction may be carried with hydrogen fluoride or tri - 
fluoromethane sulfonic acid for the Boc method, and with TFA for the Fmoc method. 

The purification of the polypeptides of the invention can be carried out by any one of 
the methods known for this purpose, Le. any conventional procedure involving 

5 extraction, precipitation, chromatography, electrophoresis, or the like. A further 
purification procedure that may be used in preference for purifying the protein of the 
invention is affinity chromatography using monoclonal antibodies or affinity groups, 
which bind the target protein and which are produced and immobilized on a gel matrix 
contained within a column. Impure preparations containing the proteins are passed 

10 through the column. The protein will be bound to the column by heparin or by the 
specific antibody while the impurities will pass through. After washing, the protein is 
eluted from the gel by a change in pH or ionic strength. Alternatively, HPLC (High 
Performance Liquid Chromatography) can be used. The elution can be carried using a 
water-acetonitrile-based solvent commonly employed for protein purification. 

15 The disclosure of the novel polypeptides of the invention, and the reagents disclosed in 
connection to them (antibodies, nucleic acids, cells) allows also to screen and 
characterize compounds that enhance or reduce their expression level into a cell or in 
an animal. 

"Oligonucleotides" refers to either a single stranded polydeoxynucleotide or two 
20 complementary polydeoxynucleotide strands which may be chemically synthesized. 
Such synthetic oligonucleotides have no 5" phosphate and thus will not ligate to 
another oligonucleotide without adding a phosphate with an ATP in the presence of a 
kinase. A synthetic oligonucleotide will ligate to a fragment that has not been 
dephosphorylated. 

25 The invention includes purified preparations of the compounds of the invention 
(polypeptides, nucleic acids, cells, etc.). Purified preparations, as used herein, refers to 
the preparations which contain at least 1%, preferably at least 5%, by dry weight of the 
compounds of the invention . 

The present patent application discloses a series of novel Preadipocyte factor-14ike 
30 polypeptides and of related reagents having several possible applications. In particular, 
whenever an increase in the Preadipocyte factor-1-iike activity of a polypeptide of the 
invention is desirable in the therapy or in the prevention of a disease, reagents such as 
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the disclosed Preadipocyte factor-1-like polypeptides, the corresponding fusion 
proteins and peptide mimetics, the encoding nucleic acids, the expressing cells, or the 
compounds enhancing their expression can be used. 

Therefore, the present invention discloses pharmaceutical compositions for the 
5 treatment or prevention of diseases needing an increase in the Preadipocyte factor-1- 
like activity of a polypeptide of the invention, which contain one of the disclosed 
Preadipocyte factor-1-like polypeptides, the corresponding fusion proteins and peptide 
mimetics, the encoding nucleic acids, the expressing cells, or the compounds 
enhancing their expression, as active ingredient. The process for the preparation of 
10 these pharmaceutical compositions comprises combining the disclosed Preadipocyte 
factor-1-like polypeptides, the corresponding fusion proteins and peptide mimetics, the 
encoding nucleic acids, the expressing cells, or the compounds enhancing their 
expression, together with a pharmaceutical^ acceptable carrier. Methods for the 
treatment or prevention of diseases needing an increase in the Preadipocyte factor -1- 
15 like activity of a polypeptide of the invention, comprise the administration of a 
therapeutically effective amount of the disclosed Preadipocyte factor-1-like 
polypeptides, the corresponding fusion proteins and peptide mimetics, the encoding 
nucleic acids, the expressing cells, or the compounds enhancing their expression. 
Amongst the reagents disclosed in the present patent application, the ligands, the 
20 antagonists or the compounds reducing the expression or the activity of polypeptides of 
the invention have several applications, and in particular they can be used in the 
therapy or in the diagnosis of a disease associated to the excessive Preadipocyte 
factor-1-like activity of a polypeptide of the invention. 

Therefore, the present invention discloses pharmaceutical compositions for the 
25 treatment or prevention of diseases associate d to the excessive Preadipocyte factor-1 - 
like activity of a polypeptide of the invention, which contain one of the ligands, 
antagonists, or compounds reducing the expression or the activity of such 
polypeptides, as active ingredient. The process for the preparation of these 
pharmaceutical compositions comprises combining the ligand, the antagonist, or the 
30 compound, together with a pharmaceutical^ acceptable carrier. Methods for the 
treatment or prevention of diseases associated to the excessive Preadipocyte factor-1- 
like activity of the polypeptide of the invention, comprise the administration of a 
therapeutically effective amount of the antagonist, the ligand or of the compound. 
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The pharmaceutical compositions of the invention may contain, in addition to 
Preadipocyte factor-1-like polypeptide or to the related reagent, suitable 
pharmaceutical^ acceptable carriers, biologically compatible vehicles and additives 
which are suitable for administration to an animal (for example, physiological saline) 
5 and eventually comprising auxiliaries (like excipients, stabilizers, adjuvants, or diluents) 
which facilitate the processing of the active compound into preparations which can be 
used pharmaceutical^. 

The pharmaceutical compositions may be formulated in any acceptable way to meet 
the needs of the mode of administration. For example, of biomaterials, sugar- 
10 macromolecule conjugates, hydrogels, polyethylene glycol and other natural or 
synthetic polymers can be used for improving the active ingredients in terms of drug 
delivery efficacy. Technologies and models to validate a specific mode of 
administration are disclosed in literature (Davis BG and Robinson MA, 2002; Gupta P 
et a/., 2002; Luo B and Prestwich GD, 2001; Cleland JL ef a/., 2001; Pillai O and 
15 Panchagnula R, 2001 ). 

Polymers suitable for these purposes are biocompatible, namely, they are non -toxic to 
biological systems, and many such polymers are known. Such polymers may be 
hydrophobic or hydrophilic in nature, biodegradable, non -biodegradable, or a 
combination thereof. These polymers include natural polymers (such as collagen, 
20 gelatin, cellulose, hyaluronic acid), as well as synthetic polymers (such as polyesters, 
polyorthoesters, polyanhydrides). Examples of hydrophobic non -degradable polymers 
include polydim ethyl siloxanes, polyurethanes, polytetrafluoroethylenes, polyethylenes, 
polyvinyl chlorides, and polymethyl methaerylates. Examples of hydrophilic non- 
degradable polymers include poly(2-hydroxyethyl methacrylate), polyvinyl alcohol, 
25 poly(N-vinyl pyrrolidone), polyalkylenes, polyacrylamide, and copolymers thereof. 
Preferred polymers comprise as a sequential repeat unit ethylene oxide, such as 
polyethylene glycol (PEG). 

Any accepted mode of administration can be used and determined by those skilled in 
the art to establish the desired blood levels of the active ingredients. For example, 
30 administration may be by various parenteral routes such as subcutaneous, 
intravenous, intradermal, intramuscular, intraperitoneal, intranasal, transdermal, oral, or 
buccal routes. The pharmaceutical compositions of the present invention can also be 
administered in sustained or controlled release dosage forms, including depot 
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injections, osmotic pumps, and the like, for the prolonged administration of the 
polypeptide at a predetermined rate, preferably in unit dosage forms suitable for single 
administration of precise dosages. 

Parenteral administration can be by bolus injection or by gradual perfusion over time. 
5 Preparations for parenteral administration include sterile aqueous or non-aqueous 
solutions, suspensions, and emulsions, which may contain auxiliary agents or 
excipients known in the art, and can be prepared according to routine methods. In 
addition, suspension of the active compounds as appropriate oily injection suspensi ons 
may be administered. Suitable lipophilic solvents or vehicles include fatty oils, for 
10 example, sesame oil, or synthetic fatty acid esters, for example, sesame oil, or 
synthetic fatty acid esters, for example, ethyl oleate or triglycerides. Aqueous injection 
suspensions that may contain substances increasing the viscosity of the suspension- 
include, for example, sodium carboxymethyl cellulose, sorbitol, and/or dextran. 
Optionally, the suspension may also contain stabilizers. Pharmaceutical compositions 
15 include suitable solutions for administration by injection, and contain from about 0.01 to 
99.99 percent, preferably from about 20 to 75 percent of active compound together with 
the excipient 

The wording "therapeutically effective amount" refers to an amount of the active 
ingredients that is sufficient to affect the course and the severity of the disease, leading 
20 to the reduction or remission of such pathology. The effective amount will depend on 
the route of administration and the condition of the patient. 

The wording "pharmaceutical^ acceptable" is meant to encompass any earner, which 
does not interfere with the effectiveness of the biological activity of the active ingredient 
and that is not toxic to the host to which is administered. For example, for parenteral 

25 administration, the above active ingredients may be formulated in unit dosage form for 
injection in vehicles such as saline, dextrose solution, serum albumin and Ringers 
solution. Carriers can be selected also from starch, cellulose, talc, glucose, lactose, 
sucrose, gelatin, malt, rice, flour, chalk, silica gel, magnesium stearate, sodium 
stearate, glycerol monostearate, sodium chloride, dried skim milk, glycerol, propylene 

30 glycol, water, ethanol, and the various oils, including those of petrole urn, animal, 
vegetable or synthetic origin (peanut oil, soybean oil, mineral oil, sesame oil). 
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lt is understood that the dosage administered will be dependent upon the age, sex, 
health, and weight of the recipient, kind of concurrent treatment, if any, frequency of 
treatment, and the nature of the effect desired. The dosage will be tailored to the 
individual subject, as is understood and determinable by one of skill in the art The total 

5 dose required for each treatment may be administered by multiple doses or in a single 
dose. The pharmaceutical composition of the present invention may be administered 
alone or in conjunction with other therapeutics directed to the condition, or directed to 
other symptoms of the condition. Usually a daily dosage of active ingredient is 
comprised between 0.01 to 100 milligrams per kilogram of body weight per day. 

10 Ordinarily 1 to 40 milligrams per kilogram per day given in divided doses or in 
sustained release form is effective to obtain the desired results. Second or subsequent 
administrations can be performed at a dosage, which is the same, less than, or greater 
than the initial or previous dose administered to the individual. 

Apart from methods having a therapeutic or a production purpose, several other 
15 methods can make use of the Preadipocyte factor-1-like polypeptides and of the 
related reagents disclosed in the present patent application. 

In a first example, a method is provided for screening candidate compounds effective 
to treat a disease related to a Preadipocyte factor-1-like polypeptide of the invention, 
said method comprising: 
20 (a)contacting host cells expressing such polypeptide, transgenic non -human 

animals, or transgenic animal cells having enhanced or reduced expression 
levels of the polypeptide, with a candidate compound and 
(b) determining the effect of the compound on the animal or on the cell. 

In a second example there is provided a method for identifying a candidate compound 
25 as an antagonist/inhibitor or agonist/activator of a polypeptide of the invention, the 
method comprising: 

(a) contacting the polypeptide, the compound, and a mammalian cell or a 
mammalian cell membrane; and 

(b) measuring whether the molecule blocks or enhances the interaction of the 
30 polypeptide, or the response that results from such interaction, with the 

mammalian cell or the mammalian cell membrane. 
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In a third example, a method for determining the activity and/or the presence of the 
polypeptide of the invention in a sample, can detect either the polypeptide or the 
encoding RNA/DNA. Thus, such a method comprises: 
(a) providing a protein -containing sample; 
5 (b) contacting said sample with a ligand of the invention; and 

(c) determining the presence of said ligand bound to said polypeptide, thereby 
determining the activity and/or the presence of polypeptide in said sample. 

In an alternative, the method comprises: 

(a) providing a nucleic acids-containing sample; 
10 (b) contacting said sample with a nucleic acid of the invention; and 

(c) determining the hybridization of said nucleic acid with a nucleic acid into the 
sample, thereby determining the presence of the nucleic acid in the sample. 

In this sense, a primer sequence derived from the nucleotide sequence presented in 
SEQ ID NO:1 or SEQ ID NO:7 can be used as well for determining the presence or the 
15 amount of a transcript or of a nucleic acid encoding a polypeptide of invention in a 
sample by means of Polymerase Chain Reaction amplification. 

A further object of the present invention are kits for measuring the activity and/or the 
presence of Preadipocyte factor-1 -like polypeptide of the invention in a sample 
comprising one or more of the reagents disclosed in the present patent application: a 
20 Preadipocyte factor-1 -like polypeptide of the invention, an antagonist, ligand or peptide 
mimetic, an isolated nucleic acid or the vector, a pharmaceutical composition, an 
expressing cell, or a compound increasing or decreasing the expression levels. 
Such kits can be used for in vitro diagnostic or screenings methods, and their actual 
composition should be adapted to the specific format of the sample (e.g. biological 

25 sample tissue from a patient), and the molecular species to be measured. For example, 
if it is desired to measure the concentration of the Preadipocyte factor-1 -like 
polypeptide, the kit may contain an antibody and the corresponding protein in a purified 
form to compare the signal obtained in Western blot. Alternatively, if it is desired to 
measure the concentration of the transcript for the Preadipocyte factor-1 -like 

30 polypeptide, the kit may contain a specific nucleic acid probe designed on the 
corresponding ORF sequence, or may be in the form of nucleic acid array containing 
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such probe. The kits can be also in the form of protein-, peptide mimetic-, or cell-based 
microarrays (Templin MF et a/., 2002; Pellois JP et a/., 2002; Blagoev B and Pandey A, 
2001), allowing high-throughput proteomics studies, by making use of the proteins, 
peptide mimetics and ceils disclosed in the present patent application. 
5 Therapeutic Uses 

SCS0009 nucleic acid molecules, polypeptides, and agonists and antagonists thereof 
can be used to treat, diagnose, ameliorate, or prevent a number of diseases, disorders, 
or conditions, including those recited herein. 

SCS0009 polypeptide agonists and antagonists include those molecules which 
10 regulate SCS0009 polypeptide activity and either increase or decrease at least one 
activity of the mature form of the SCS0009 polypeptide. Agonists or antagonists may 
be co-factors, such as a protein, peptide, carbohydrate, lipid, or small molecular weight 
molecule, which interact with SCS0009 polypeptide and thereby regulate its activity. 

Potential polypeptide agonists or antagonists include antibodies that react with either 
15 soluble or membrane -bound forms of SCS0009 polypeptides that comprise part or ail 
of the extracellular domains of the said proteins. Molecules that regulate SCS0009 
polypeptide expression typically include nucleic acids encoding SCS0009 polypeptide 
that can act as anti- sense regulators of expression. 

The present patent application discloses novel Preadipocyte factor-1-like polypeptides 
20 and a series of related reagents that may be useful, as active ingredients in 
pharmaceutical compositions appropriately formulated, in the treatment or prevention 
of diseases and conditions in which Preadipocyte factor-1-like proteins are implicated, 
such as obesity, organomegaly, diabetes (including insulin resistance, 
hyperinsulinemia, hyperglycemia, hypertriglyceridemia), adrenocortical dysfunction, 

25 hypertrophy of cardiac and skeletal muscle, lipodystrophy (see OMIM*269700), and 
immune system disorders including autoimmune diseases and immunodeficiencies 
(Jensen et al. showed that DLK1 is expressed in the CNS. Neuroreport. 2001 Dec 
21;12(18):3959~63), developmental defects, cancer, autoimmune thyroid diseases and 
related disorders such as opthaimopathies and other pathological conditions in which 

30 Preadipocyte factor-1 protein is implicated. 

Moon et al. Show that Pref-1 shows growth retardation, obesity, blepharophimosis, 
skeletal malformation, and increased serum lipid metabolites (Yang Soo moon et al. 
Moleculat and Cellular Biology. 2002. Vol.22(15):5585-5592). As such SCS0009 
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nucieic acid molecules, polypeptides, agonists and antagonists thereof may be useful 
in diagnosing or treating growth retardation, obesity, blepharophimosis, skeletal 
malformation, and decrease serum lipid metabolites. 

Murphy et al. indicate that DLK1 might contribute to the discordant phenotypes 
5 associated with uniparetnal disomy (UDP) of chromosome 14 (Murphy SK et al. Hum. 
Mutat 2003 Ju1;22(1):92-7.). Furthermore, the paper of Sutton et al. suggests that 
DLK1 might be involved in the onset of mental retardation and maternal uniparental 
disomy (Am J Med Genet 2002 Sep 15;112(1):23-7). In addition, Kobayashi et al. point 
out that DLK1 may contribute to the scoliosis phenotype observed in UDP (Kobayashi 
10 S et al. Gene Cells. 2000 Dec;5(12):1029-37). As such SCS0009 nucleic acid 
molecules, polypeptides, and agonists and antagonists thereof may be useful in 
diagnosing or treating mental retardation, maternal uniparental disomy or scoliosis. 

Van Limpt showed that DLK1 is highly expressed in a subset of neuroblastoma cell 
lines (Int. J Cancer. 2003 May 20;105(1):61 -9; see also van Limpt V et al. Med Pediatr 

15 Oncol. 2000 Dec;35(6):554-8; and Online Mendeiian Inheritance in Man™ 
(OMIM)*1 76290: httD:/Awvw.ncbi.nlm.nih-QOv/entrez/auerv.fcQi ?db=OMIM). As such 
SCS0009 nucleic acid molecules, polypeptides, agonists and antagonists thereof may 
be useful in diagnosing or treating neuroblastoma, pheochromocytoma, lung tumors, 
neuroendocrine tumors. 

20 Dogget et al. state that DLK1 is essential for normal hematopoieisis and abnormal 
expression is a proposed marker of myelodysplastic syndrome (Dogget KL et al. J Cell. 
Bioichem. 2002;86(1):56-66). As such SCS0009 nucleic acid molecules, polypeptides, 
agonists and antagonists thereof may be useful in diagnosing or treating 
myelodysplastic syndrome. 

25 The therapeutic applications of the polypeptides of the invention an d of the related 
reagents can be evaluated (in terms or safety, pharmacokinetics and efficacy) by the 
means of the in vivo I in vitro assays making use of animal cell, tissues and or by the 
means of in siiico I computational approaches (Johnson DE and Wolfgang GH, 2000), 
known for the validation of Preadipocyte factor-1-like polypeptides and other biological 

30 products during drug discovery and preclinical development. 

The invention will now be described with reference to the specific embodiments by 
means of the following Examples, which should not be construed as in any way limiting 
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the present invention. The content of the description comprises all modifications and 
substitutions which can be practiced by a person skilled in the art in light of the above 
teachings and, therefore, without extending beyond the meaning and purpose of the 
claims. 



5 
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TABLE I 



Amino Acid 


Synonymous Groups 


More Preferred Synonymous Groups 


Ser 


Gly, Ala, Ser, Thr, Pro 


Thr, Ser 


Arg 


Asn, Lys, Gin, Arg, His 


Arg, Lys, His 


Leu 


Phe, He, Val, Leu, Met 


lie, Val, Leu, Met 


Pro 


Gly, Ala, Ser, Thr. Pro 


Pro 


Thr 


Gly, Ala, Ser. Thr, Pro 


Thr, Ser 


Ala 


Gly, Thr, Pro, Ala, Ser 


Gly, Ala 


Val 


Met, Phe, lie, Leu, Val 


Met, lie, Val, Leu 


Gly 


Ala, Thr, Pro, Ser, Gly 


Gly, Ala 


lie 


Phe. He, Val, Leu, Met 


He, Val, Leu, Met 


Phe 


Trp, Phe.Tyr 


Tyr, Phe 


Tyr 


Trp, Phe.Tyr 


Phe, Tyr 

^ ■ — . — — — 


Cys 


Ser, Thr, Cys 


Cys 


His 


Asn, Lys. Gin, Arg, His 


Arg, Lys, His 


Gin 


Glu, Asn, Asp, Gin 


Asn, Gin 


Asn 


Glu, Asn, Asp, Gin 


Asn, Gin 


Lys 


Asn, Lys, Gin, Arg, His 


Arg, Lys, His 


Asp 


Glu, Asn, Asp, Gin 


Asp, Glu 


Glu 


Glu, Asn, Asp, Gin 


Asp, Glu 


Met 


Phe, lie, Val, Leu, Met 


He, Val, Leu, Met 


Trp 


Trp, Phe,Tyr 


Trp 
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TABLE II 



Amino Acid 


Synonymous Groups 


Ser 


D-Ser, Thr, D-Thr, allo-Thr, Met, D-Met, Met(O), D-Met(O), L-Cys, 
D-Cys 


Arg 


D-Arg, Lys, D-Lys, homo-Arg, D-homo-Arg, Met, lie, D-.Met, D- 
lle, Orn, D-Om 


Leu 


D-Leu, Val, D-Val, AdaA, AdaG, Leu, D-Leu f Met, D-Met 


Pro 


D-Pro, L-l-thioazolidine-4-carboxylicacid, D-or L-1-oxazolidine~4- 
carboxylic acid 


Thr 


D-Thr, Ser, D-Ser, allo-Thr, Met,D-Met Met(O), D-Met(O), Val, D- 
Val 


Ala 


D-Ala, Gly, Aib, B-Ala, Acp, L-Cys, D-Cys 


Val 


D-Val, Leu, D-Leu, lie, D-Ile, Met, D-Met, AdaA, AdaG 


Gly 


Ala, D-Ala, Pro, D-Pro, Aib, .beta. -Ala, Acp 


lie 


D-lle, Val, D-Val, AdaA, AdaG, Leu, D-Leu, Met, D-Met 


Phe 


D-Phe, Tyr, D-Thr, L-Dopa, His, D-His, Trp, D-Trp, Trans-3,4, or 
5-phenylprohne, AdaA, AdaG, cis-3,4, or 5-phenylprohne, bpa, u- 
Bpa 


Tyr 


D-Tyr, Phe, D-Phe, L-Dopa, His, D-His 


Cys 


D-Cys, S-Me-Cys, Met, D-Met, Thr, D-Thr 


Gin 


D-GIn, Asn, D-Asn, Glu, D-Glu, Asp, D-Asp 


Asn 


D-Asn, Asp, D-Asp, Glu, D-Glu, Gin, D-GIn 


Lys 


D-Lys, Arg, D-Arg, homo-Arg, D-homo-Arg, Met, D-Met, lie, D-lle, 
Orn, D-Orn 


Asp 


D-Asp, D-Asn, Asn, Glu, D-Glu, Gin, D-GIn 


I Glu 


D-Glu, D-Asp, Asp, Asn, D-Asn, Gin, D-GIn 


I Met 


D-Met, S-Me-Cys, lie, D-lle, Leu, D-Leu, Val, D-Val 
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EXAMPLES 

Example 1 : 

Sequences of EGF protein domains from the ASTRAL database (Brenner SE ef al. 

5 "The ASTRAL compendium for protein structure and sequence analysis" Nucleic Acids 
Res. 2000 Jan 1; 28 (1): 254-6) were used to search for homologous protein 
sequences in genes predicted from human genome sequence (Celera database). The 
protein sequences were obtained from the gene predictions and translations thereof as 
generated by one of three programs: the Genescan (Burge C, Karlin S„ "Prediction of 

10 complete gene structures in human genomic DNA, J Mol Biol. 1997 Apr 25;268(1):78 - 
94) Grail (Xu Y, Uberbacher EC, « Automated gene identification in large -scale 
genomic sequences", J Comput Biol. 1997 Fall;4(3):325-38) and Fgenesh (Proprietary 
Celera software). 

The sequence profiles of the EGF domains were generated using PIMAII (Profile 

15 Induced Multiple Alignment; Boston University software, version II, Das S and Smith TF 
2000), an algorithm that aligns homologous sequences and generates a sequence 
profile. The homology was detected using PIMAII that generates global -local 
alignments between a query profile and a hit sequence. In this case the algorithm was 
used with the profile of the EGF functional domain as a query. PIMAII compares the 

20 query profile to the database of gene predictions translated into protein sequence and 
can therefore identify a match to a DNA sequence that contains that domain. Further 
comparison by BLAST (Basic Local Alignment Search Tool; NCBI vers ion 2) of the 
sequence with known EFG containing proteins identified the closets homolog (Gish W, 
States DJ. "Identification of protein coding regions by database similarity search.", Nat 

25 Genet. 1993 Mar;3(3):266-72; Pearson WR, Miller W., "Dynamic programming 
algorithms for biological sequence comparison.", Methods Enzymol. 1992;210:575-601; 
Altschul SF ef a/., "Basic local alignment search tool", J Mol Biol. 1990 Oct 
5;21 5(3):403-1 0). PIMAII parameters used for the detection were the PIMA prior amino 
acids probability matrix and a Z-cutoff score of 10. BLAST parameters used were: 

30 Comparison matrix = BLOSUM62; word length = 3; .E value cutoff =10; Gap opening 
and extension = default; No filter. 
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Once the functional domain was identified in the sequence, the genes were re- 
predicted with the genewise algorithm using the sequence of the closets homolog 
(Birney E et a/., " PairWise and SearchWise: finding the optimal alignment in a 
simultaneous comparison of a protein profile against all DNA translation frames.", 
5 Nucleic Acids Res. 1996 Jul 15;24(14):2730-9) 

The profiles for homologous EGF domains were generated automatically using the 
PSI-BLAST (Altshul etal 1997) scripts written in PERL (Practical Extraction and Report 
Language) and PIMAII. 

A total of 55 predicted genes out of the 464 matching the original query generated on 
10 the basis of EGF domain profiles were selected since they were judged as potentially 
novel. 

The novelty of the protein sequences was finally assessed by searching protein 
databases (SwissProt/Trembl, Human IPI and Derwent GENESEQ) using BLAST and 
a specific annotation has been attributed on the basis of amino acid sequence 
15 homology. 
Example 2 : 

One sequence isolated by the methodology set out in Example 1 is that referred to 
herein as SCS0009 polypeptide sequence. 

The protein corresponds to a shorter version of longer sequence that is identified in a 
20 number of patent applications, essentially lacking the third of the six EGF repeats 
present in the central region of the protein (see Figure 1). Some fragments were 
identified on the basis of the homology with EGF. 

Amongst these patent applications, WO01 57233 (HYSEQ) describes a protein having 
a homology with Pref-1 of about 50%, and portions homologous to many different 
25 proteins, such as wheat germ agglutinin, laminin, coagulation factors, fibrillin, TNF-RI, 
IGF-R1, and e-selectin. 

Preadipocyte factor-1 (Pref-1, SWISSPROT Q09163; also called Adipocyte 
differentiation inhibitor protein, Delta-like protein, and fetal antigen 1), is a membrane 
protein known to inhibit adipocyte differentiation and mediate GH antiadipogenic 
30 effects. Knock-out mice present growth retardation, obesity, blepharophimosis, skeletal 
malformation, and increased serum lipid metabolites, suggesting that Prf -1 is important 
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for homeostasis of adipose tissue mass. Several alternative splicing variants are 
known, but mostly differ in the C-terminal region, comprising the transmembrane 
/intracellular sequences. 

5 Example 3 - Identification and cloning of SCS0009 and of splice var iants 
SCS0009-SV3 and SCS0009-SV4: 

3.1 Introduction: 

SCS0009 is a 1663 nucleotide cDNA prediction spanning 6 exons, encoding an EGF 
10 domain containing protein of 352 amino acids, with homology to preadipocyte factor - 
1/delta- like protein. SCS0009 is a splice variant of sequences in Celera 
(hCP1782513.1), NAgeneseq (AAH78208), AAgeneseq (ADA06923) and Swissprot 
(AAQ88493). The latter sequences contain an additional 31 amino acids occurring 
between amino acids 90 and 91 of the prediction. The sequence with the 31 amino acid 
15 insertion is called SCS0009-SV6. Two image clones were also identified which appear 
to be splice variants of SCS0009. SCSQ009-SV3 (Image clone 5478078) is identical to 
SV5 except that it contains a 6 amino acid deletion after amino acid 84. SCS0009-SV4 
(Image clone 3349698) is a truncated version of SCS0009-SV5. The alignment of 
these splice variants is shown in figure 2. 
20 In addition to splice variants 3, 4 and 5 we also identified 2 further splice variants which 
lacked the first 62 amino acids of the SCS0009 prediction and hence did not contain 
the predicted signal peptide sequence. SV1 also contained the 31 amino acid insertion 
seen in the other splice variants. SV2 was identical to SV1 except for the deletion of 6 
amino acids upstream of the 31 amino acid insertion. The alignment of SCS0009 with 
25 SV1 and SV2 shown in figure 3. 

Expression constructs for SCS0009 and its splice variants were generated as follows: 
Image clone 5478078 was used as a template to generate a C-terminal 6H1S tagged 
version of SCS0009-SV3. Image clone 3349698 was used as a template to generate a 
C-terminal 6HIS tagged version of SCS0009-SV4. SCS0009-SV5 was engineered from 
30 SV3. SCS0009 was engineered from SV5. 
3.2 Cloning of SCS0009-SV3 

The SCS0009 is a 1663 nucleotide cDNA prediction spanning 6 exons, encoding an 
EGF domain containing protein of 352 amino acids, with homology to preadipocyte 
factor-1 /delta- like protein. An IMAGE clone (5478078), which contained the SCS0009 
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coding sequence (figure 4), was purchased from ATCC (plasmid ID 14670) and 
sequence verified using SP6 and T7 primers (table 1). The cDNA insert in the image 
clone differs from the SCS0009 prediction in that it has a deletion of 18 nucleotides (6 
amino acids) at the end of exon 3. It also contains a 5' extension of 93 nucleotides In 
5 exon 4 (resulting in a 31 amino acid insertion) (figure 5) and so appears to be a splice 
variant of SCS0009 which has been called SCS0009 -SV3 (figure 6) 



3.2.1 Construction of mammalian cell expression vecto rs for SCS00009-SV3 
Plasmid 14670 was used as a PCR template to generate pEAK12d (figure 7) 
and pDEST12.2 (figure 8) expression clones containing the SCS0009-SV3 ORF 
sequence with a 3' sequence encoding a 6HIS tag using the Gateway™ cloning 
methodology (Invitrogen). 

3.2.2 Generation of Gateway compatible SCS0009-SV3 ORF fu sed to an in 
frame 6HIS tag sequence. 

The first stage of the Gateway cloning process involves a two step PCR 
reaction which generates the ORF of SCS0009-SV3 flanked at the 5* end by an 
attB1 recombination site and Kozak sequence, and flanked at the 3* end by a 
sequence encoding an in frame 6 histidine (6HIS) tag, a stop codon and the 
attB2 recombination site (Gateway compatible cDNA). The first PCR reaction 
(in a final volume of 50 contains: 1 \x\ of plasmid 14670, 1.5 \i\ dNTPs (10 
mM), 10 \i\ of 10X Pfx polymerase buffer, 1 |xl MgS04 (50 mM), 0.5 jil each of 
gene specific primer (100 ^M) (SCS0009-SV3-EX1 and SCS0009-SV3-EX2), 
and 0.5 fxl Platinum Pfx DNA polymerase (Invitrogen). The PCR reaction was 
performed using an initial denaturing step of 95 °C for 2 min, followed by 12 
cycles of 94 °C for 15 s; 55 °C for 30 s and 68 °C for 2 min; and a holding cycle 
of 4 °C. The amplification products were visualized on 0.8 % agarose gel in 1 X 
TAE buffer (Invitrogen) and a product migrating at the predicted molecular mass 
was purified from the gel using the Wizard PCR Preps DNA Purification System 
(Promega) and recovered in 50 jil sterile water according to the manufacturer's 
instructions. 

The second PCR reaction (in a final volume of 50 contained 10 pi purified 
PCR 1 product, 1.5 \i\ dNTPs (10 mM), 5 |xl of 10X Pfx polymerase buffer, 1 \x\ 
MgS04 (50 mM), 0.5 \i\ of each Gateway conversion primer (100 \M) (GCP 
forward and GCP reverse) and 0.5 |nl of Platinum Pfx DNA polymerase. The 
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conditions for the 2nd PCR reaction were: 95 °C for 1 min; 4 cycles of 94 °C, 15 
sec; 50 °C, 30 sec and 68 °C for 2 min; 25 cycles of 94 °C, 15 sec; 55 °C , 30 
sec and 68 °C, 2 min; followed by a holding cycle of 4 °C. PCR products were 
gel purified using the Wizard PCR prep DNA purification system (Promega) 
according to the manufacturer's instructions. 

3.2.3 Subclonina of Gateway compatible SC S0009-SV3 ORF into Gateway 
entry vector DDONR221 and expressio n vectors pEAK12d and pDEST12.2 
The second stage of the Gateway cloning process involves subcloning of the 
Gateway modified PCR product into the Gateway entry vector pDONR221 
(Invitrogen, figure 9) as follows: 5 ^l of purified product from PCR2 were 
incubated with 1.5 *iJ pDONR221 vector (0.1 \ig/vd) t 2 |il BP buffer and 1.5 ^ of 
BP clonase enzyme mix (Invitrogen) in a final volume of 10 \i\ at RT for 1 h. The 
reaction was stopped by addition of proteinase K 1 \i\ (2 Mg'M-0 and incubated 
at 37 °C for a further 10 min. An aliquot of this reaction (1 ^l) was used to 
transform E. coli DH10B cells by electroporation as follows: a 25 \i\ aliquot of 
DH10B electrocompetent cells (Invitrogen) was thawed on ice and 1 \i\ of the 
BP reaction mix was added. The mixture was transferred to a chilled 0.1 cm 
electroporation cuvette and the cells electroporated using a BioRad Gene- 
Pulser™ according to the manufacturer's recommended protocol. SOC media 
(0.5 ml), which had been pre-warmed to room temperature, was added 
immediately after electroporation. The mixture was transferred to a 15 mi snap- 
cap tube and incubated, with shaking (220 rpm) fqr 1 h at 37 °C. Aliquots of the 
transformation mixture (10 nl and 50 jil) were then plated on L-broth (LB) plates 
containing kanamycin (40 fig/ml) and incubated overnight at 37 °C. 
Plasmid mini-prep DNA was prepared from 5 ml cultures from 6 of the resultant 
colonies using a Qiaprep Turbo 9600 robotic system (Qiagen). Plasmid DNA 
(150-200 ng) was subjected to DNA sequencing with 21M13 and M13Rev 
primers using the BigDyeTerminator system (Applied Biosystems cat. no. 
4390246) according to the manufacturer's instructions. The primer sequences 
are shown in Table 1. Sequencing reactions were purified using Dye -Ex 
columns (Qiagen) or Montage SEQ 96 cleanup plates (Millipore cat no. 
LSKS09624) then analyzed on an Applied Biosystems 3700 sequencer. 
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Plasmid eluate (2 pi or approx. 150 ng) from one of the clones which contained 
the correct sequence (pENTR-SCS0009-SV3-6HIS, plasmid ID 14879, figure 
10) was then used in a recombination reaction containing 1.5 pi of either 
pEAK12d vector or pDEST12.2 vector (figures 3 & 4) (0.1 M9 / PO, 2 pl LR 
buffer and 1.5 pl of LR cionase (Invitrogen) in a final volume of 10 pl. The 
mixture was incubated at RT for 1 h, stopped by addition of proteinase K (2 MO) 
and incubated at 37 °C for a further 10 min. An aliquot of this reaction (1 ul) was 
used to transform E. coli DH10B cells by electro poration as follows: a 25 pi 
aliquot of DH10B electrocompetent cells (Invitrogen) was thawed on ice and 1 
pi of the LR reaction mix was added. The mixture was transferred to a chilled 
0.1 cm electroporation cuvette and the cells electroporated using a BioRad 
Gene-Pulser™ according to the manufacturer's recommended protocol. SOC 
media (0.5 ml), which had been pre-warmed to room temperature, was added 
immediately after electroporation. The mixture was transferred to a 15 ml snap- 
cap tube and incubated, with shaking (220 rpm) for 1 h at 37 °C. Aliquots of the 
transformation mixture (10 pi and 50 pi) were then plated on L-broth (LB) plates 
containing ampicillin (100 pg/ml) and incubated overnight at 37 °C. 
Plasmid mini-prep DNA was prepared from 5 ml cultures from 6 of the resultant 
colonies subcioned in each vector using a Qiaprep Turbo 9600 robotic system 
(Qiagen). Plasmid DNA (200-500 ng) in the pEAK12d vector was subjected to 
DNA sequencing with pEAK12F, pEAK12R and SCS0009 -SV3-SP1 primers as 
described above. Plasmid DNA (200-500 ng) in the pDEST12.2 vector was 
subjected to DNA sequencing with 21M13 and M13Rev and SCS0009-SV3- 
SP1 primers as described above. Primer sequences are shown in Table 1. 
CsCI gradient purified maxi-prep DNA was prepared from a 500 ml culture of 
one of each of the sequence verified clones (pEAK12d-SCS0009-SV3-6HIS, 
plasmid ID number 14885, figure 11, and pDEST12.2-SCS0009-SV3-6HIS, 
plasmid ID 14889, figure 12) using the method described by Sambrook J. et al., 
1989 (in Molecular Cloning, a Laboratory Manual, 2 nd edition, Cold Spring 
Harbor Laboratory Press), Plasmid DNA was resuspended at a concentration of 
1 pg/pJ in sterile water (or 10 mM Tris-HCI pH 8.5) and stored at -20 °C. 
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Table III SCS0009-SV3 cloning and sequencing primers 



Primer 


Sequence (5' -3') 


SCS0009SV3 -EX1 


AA GCA GGC TTC GCC ACC ATG CCC AGC GGC TGC CGC TG 


SCS0009SV3 -EX2 


GTG ATG GTG ATG GTG CAG TGC TGT GGT CTT TCC AG 


GCP Forward 


C GGG KCfi ITO TAP. AAA AAA GCA GGC TTC GCC ACC 


GCP Reverse 


GGG GAC CAC TTT GTA CAA GAA AGC TGG GTT TCA ATG GTG 
ATG GTG ATG GTG 


SCS0009SV3-SP1 


TGA TGC GGC CTT GTG CTA AC 


pEAK12F 


GCC AGC TTG GCA CTT GAT GT 


PEAK12R 


GAT GGA GGT GGA CGT GTC AG 


21M13 


TGT AAA ACG ACG GCC AGT 


Ml 3 REV 


CAG GAA ACA GCT ATG ACC 


T7 primer 


TAA TAC GAC TCA CTA TAG GG 


SP6 primer 


ATT TAG GTG ACA CTA TAG 



Underlined sequence = Kozak sequence 
Bold = Stop codon 



Italic sequence 



= His tag 



10 3.3 Cloning of SCS0QQ9-SV4 

The SCS0009 is a 1663 nucleotide cDNA prediction spanning 6 exons, encoding an 
EGF domain containing protein of 352 amino acids, with homology to preadipocyte 
factor-1/delta- like protein. An IMAGE clone (3349698), which contained the SCS0009 
coding sequence (figure 13), was purchased from ATCC (plasmid ID 14680) and 

15 sequence verified using SP6 and T7 primers (table 1). The cDNA insert in the image 
clone differs from the SCS0009 prediction in that it contains a 5' extension of 93 
nucleotides in exon 4 of the prediction (resulting in a 31 amino acid insertion). There is 
also a deletion of 133 bp near to the end of exon 4 which leads to a frameshift and 
immediate stop codon producing a truncated version of SCS0009 (figure 14). The 

20 cDNA of the image clone therefore appears to encode a splice variant of SCS0009 
which we have called SCS0009-SV4. The alignment of SCS0009 with the splice variant 

is shown in figure 1 5. 

3.3.1 Construction of mammalian cell exp ression vectors for SCS0Q009-SV4 
Plasmid 14680 was used as a PCR template to generate pEAK12d (figure 16) 
25 and pDEST12.2 (figure 17) expression clones containing the SCS0009-SV4 

ORF sequence with a 3' sequence encoding a 6HIS tag using the Gateway™ 
cloning methodology (Invitrogen). 
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3.3.2 Generation of Gateway compatible SCS0009- SV4 QRF fused to 
an in frame 6H1S tag sequence. 

The first stage of the Gateway cloning process involves a two step PCR 
reaction which generates the ORF of SCS0009-SV4 flanked at the 5' end by an 
attB1 recombination site and Kozak sequence, and flanked at the 3' end by a 
sequence encoding an in frame 6 histidine (6HIS) tag, a stop codon and the 
attB2 recombination site (Gateway compatible cDNA). The first PCR reaction 
(in a final volume of 50 yd) contains: 1 \x\ of plasmid 14680, 1.5 *il dNTPs (10 
mM), 10 \i\ of 10X Pfx polymerase buffer, 1 \x\ MgS04 (50 mM), 0.5 ^ each of 
gene specific primer (100 |iM) (SCS0009-SV4-EX1 and SCS0009-SV4-EX2), 
and 0.5 |xl Platinum Pfx DNA polymerase (Invitrogen). The PCR reaction was 
performed using an initial denaturing step of 95 °C for 2 min, followed by 12 
cycles of 94 °C for 15 s; 55 °C for 30 s and 68 °C for 2 min; and a holding cycle 
of 4 °C. The amplification products were visualized on 0.8 % agarose gel in 1 X 
TAE buffer (Invitrogen) and a product migrating at the predicted molecular mass 
was purified from the gel using the Wizard PCR Preps DNA Purification System 
(Promega) and recovered in 50 fil sterile water according to the manufacturer's 
instructions. 

The second PCR reaction (in a final volume of 50 jil) contained 10 pi purified 
PCR 1 product, 1.5 \i\ dNTPs (10 mM), 5 \i\ of 10X Pfx polymerase buffer, 1 \i\ 
MgS04 (50 mM), 0.5 jil of each Gateway conversion primer (100 ^M) (GCP 
forward and GCP reverse) and 0.5 of Platinum Pfx DNA polymerase. The 
conditions for the 2nd PCR reaction were: 95 °C for 1 min; 4 cycles of 94 °C, 15 
sec; 50 °C, 30 sec and 68 °C for 2 min; 25 cycles of 94 °C, 15 sec; 55 °C , 30 
sec and 68 °C, 2 min; followed by a holding cycle of 4 °C. PCR products were 
gel purified using the Wizard PCR prep DNA purification system (Promega) 
according to the manufacturer's instructions. 

3.3.3 Subclonina of Gateway compatible SCS0009-S V4 ORF into Gateway 
entry vector PDONR221 and expression vectors oEAK1 2d and PDEST12.2 
The second stage of the Gateway cloning process involves subcloning of the 
Gateway modified PCR product into the Gateway entry vector pDONR221 
(Invitrogen, figure 18) as follows: 5 jll! of purified product from PCR2 were 
incubated with 1.5 |Al pDONR221 vector (0.1 jmg/jAl). 2 M-l BP buffer and 1.5 pJ of 
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BP clonase enzyme mix (Invitrogen) in a final volume of 10 jxl at RTfor 1 h. The 
reaction was stopped by addition of proteinase K 1 pi (2 pg/pl) and incubated at 
37 °C for a further 10 min. An aliquot of this reaction (1 p.l) was used to 
transform E. coli DH10B cells by electroporation as follows: a 25 jxl aliquot of 
DH10B electrocompetent cells (Invitrogen) was thawed on ice and 1 pi of the 
BP reaction mix was added. The mixture was transferred to a chilled 0.1 cm 
electroporation cuvette and the cells electroporated using a BioRad Gene- 
Pulser™ according to the manufacturer's recommended protocol. SOC media 
(0.5 ml), which had been pre-warmed to room temperature, was added 
immediately after electroporation. The mixture was transferred to a 15 ml snap- 
cap tube and incubated, with shaking (220 rpm) for 1 h at 37 °C. Aliquots of the 
transformation mixture (10 pi and 50 pi) were then plated on L-broth (LB) plates 
containing kanamycin (40 pg/ml) and incubated overnight at 37 °C. 
Plasmid mini-prep DNA was prepared from 5 ml cultures from 6 of the resultant 
colonies using a Qiaprep Turbo 9600 robotic system (Qiagen). Plasmid DNA 
(150-200 ng) was subjected to DNA sequencing with 21M13 and M13Rev 
primers using the BigDyeTerminator system (Applied Biosystems cat. no. 
4390246) according to the manufacturer's instructions. The primer sequences 
are shown in Table 1. Sequencing reactions were purified using Dye-Ex 
columns (Qiagen) or Montage SEQ 96 cleanup plates (Millipore cat. no. 
LSKS09624) then analyzed on an Applied Biosystems 3700 sequencer. 
Plasmid eluate (2 pi or approx. 150 ng) from one of the clones which contained 
the correct sequence (pENTR-SCS0009-SV4-6HIS, plasmid ID 15055, figure 
19) was then used in a recombination reaction containing 1.5 |xl of either 
pEAK12d vector or pDEST12.2 vector (figures 16 & 17) (0.1 pg / pi). 2 pi LR 
buffer and 1.5 pi of LR clonase (Invitrogen) in a final volume of 10 pi. The 
mixture was incubated at RT for 1 h, stopped by addition of proteinase K (2 pg) 
and incubated at 37 °C for a further 10 min. An aliquot of this reaction (1 ul) was 
used to transform E. coli DH10B cells by electroporation as follows: a 25 pJ 
aliquot of DH10B electrocompetent cells (Invitrogen) was thawed on ice and 1 
pJ of the LR reaction mix was added. The mixture was transferred to a chilled 
0.1 cm electroporation cuvette and the cells electroporated using a BioRad 
Gene-Pulser™ according to the manufacturer's recommended protocol. SOC 
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media (0.5 ml), which had been pre-warmed to room temperature, was added 
immediately after electroporation. The mixture was transferred to a 15 ml snap- 
cap tube and incubated, with shaking (220 rpm) for 1 h at 37 °C. Aliquots of the 
transformation mixture (10 |il and 50 yd) were then plated on L-broth (LB) plates 
containing ampicillin (100 \ig/m\) and incubated overnight at 37 °C. 
Plasmid mini-prep DNA was prepared from 5 ml cultures from 6 of the resultant 
colonies subcloned in each vector using a Qiaprep Turbo 9600 robotic system 
(Qiagen). Plasmid DNA (200-500 ng) in the pEAK12d vector was subjected to 
DNA sequencing with pEAK12F and pEAK12R primers as described above. 
Plasmid DNA (200-500 ng) in the pDEST12.2 vector was subjected to DNA 
sequencing with 21M13 and M13Rev primers as described above. Primer 
sequences are shown in Table 1 . 

CsCl gradient purified maxi-prep DNA was prepared from a 500 ml culture of 
one of each of the sequence verified clones (pEAK12d-SCS0G09-SV4-6HIS, 
plasmid ID number 15061, figure 20, and pDEST12.2-SCS0009-SV4-6HIS, 
plasmid ID 15063, figure 21) using the method described by Sambrook J. et ah, 
1989 (in Molecular Cloning, a Laboratory Manual, 2 nd edition, Cold Spring 
Harbor Laboratory Press), Plasmid DNA was resuspended at a concentration of 
1 \lq!\i\ in sterile water (or 10 mM Tris-HCI pH 8.5) and stored at -20 °C. 



20 
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Table IV SCS0009-SV4 cloning and sequencing primers 



Primer 


Sequence (5' -3') 


SCS0009SV4 -EX1 


AA GCA GGC TTC GCC ACC ATG CCC AGC GGC TGC CGC TG 


SCS0009SV4-EX2 


GTG ATG GTG ATG GTG GGG TCC AGC CTT GCG CTC GC 


GCP Forward 


G GGG ar-n TTfi TAH AAA AAA GCA GGC TTC GCC ACC 


GCP Reverse 


GGG GAC CAC TTT GTA CAA GAA AGC TGG GTT TCA ATG GTG 
ATG GTG ATG GTG 


PEAK12F 


GCC AGC TTG GCA CTT GAT GT 


PEAK12R 


GAT GGA GGT GGA CGT GTC AG 


21M13 


TGT AAA ACG ACG GCC AGT 


Ml 3 REV 


CAG GAA ACA GCT ATG ACC 


T7 primer 


TAA TAC GAC TCA CTA TAG GG 


SP6 primer 


ATT TAG GTG ACA CTA TAG 



Underlined sequence = Kozak sequence 
Bold = Stop codon 

10 Italic sequence = His tag 

3.4 Cloning of SCS0QQ9 by exon assembly 

SCS0009 is a prediction of 1663 nucleotides spanning 6 exons which encodes a 
protein of 352 amino acids with homology to preadipocyte factor -1 /delta- like protein 
15 (Figure 22). Various splice variants of the SCS0009 prediction have already been 
cloned. The version with closest sequence identity to the SCS0009 prediction, called 
SCS0009-SV5 (plasmid ID. 14846) differs from SCS0009 only by the presence of a 31 
amino acid insertion between exons 3 and 4. In order to generate SCS0009 protein: 

- Exons 1-3 and exons 4-5 of SCS0009 were amplified from plasmid ID. 14846 
20 (pCR4-TOPO-SCS0009-SV5) by PCR (Figure 23). 

- The gel -purified exons were mixed and a new PCR reaction was performed to 
amplify the re-assembled DNA. 

- The full length PCR product corresponding to the SCS0009 coding sequence 
(Figure 24) was subcloned into pCR-Bluntll-TOPO cloning vector (Invitrogen) 

25 and then sequentially into pDONR 201 (Gateway entry vector) and expression 

vectors pEAK12d and pDEST12.2. (expression vectors) using the Invitrogen 
Gateway™ methodology. 
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3.4.1 PGR amplification of exons encoding SCS0009 from plasmid ID. 14846 

fpCR4-TOPO-SCS0009-SV5V 
PGR primers were designed to amplify exons 1-3 and exons 4-6 of 
SCS0009 (Table 1). The reverse primer for exon 3 (SCS0009 -AP2) has an 
overlap of 18 bp with exon 4 of SCS0009 at its 5' end. The forward primer 
for exon 4 (SCS0009 -AP3) has an 18 bp overlap with exon 3 of SCS0009 
at its 5' end. 

To generate the product representing exons 1 -3 of SCS0009, the PCR 
reaction was performed in a final volume of 50 |il containing 100 ng of 
plasmid ID 14846, 1.5 of 10 mM dNTPs (Amersham Pharmacia Biotech), 
1 mJ of MgS0 4 (Invitrogen), 1.5 pi of SCS0009-AP1 (10 pM). 1.5 pi of 
SCS0009-AP2 (10 pM), 10 pi of 1 0X Pfx buffer and 0.5 pi of Pfx 
polymerase (2.5 U/pl) (Invitrogen). The PCR conditions were 94 °C for 5 
min; 25 cycles of 94 °C for 15 s, 68 °C for 1 min ; an additional elongation 
cycle of 68 °C for 7 min; and a holding cycle of 4 °C. Reaction products 
were loaded onto a 0.8 % agarose gel (1X TAE) and PCR products of the 
correct size (292 bp) were gel -purified using the Wizard PCR Preps DNA 
Purification System (Promega) and recovered in 50 pi sterile water 
according to the manufacturer's instructions. The product representing 
exons 4-5 of SCS0009 was produced and purified using the same method 
except PCR primers SCS0009-AP3 / SCS0009-AP4 were used. The 
SCS0009-AP3 / SCS0009-AP4 PCR product was 806 bp. 
3.4.2 Assembly of exons 1 -3 and exons 4-5 t o generate the SCS0009 ORF 
Exons 1-3 and 4-5 were assembled in a 50 PCR reaction containing 2 pi 
of gel purified exon 1 -3 product, 2 pJ of gel purified exon 4-5 product, 1.5 |xl 
of 10 mM dNTPs (Amersham Pharmacia Biotech), 1 pJ of MgS0 4 
(Invitrogen), 1.5 pi of SCS0009-AP1 (10 pM), 1.5 pi of SCS0009-AP4 (10 
pM), 10 pi of 1 0X Pfx buffer and 0.5 pi of Pfx polymerase (2.5 U/pl) 
(Invitrogen). The PCR conditions were 94 °C for 5 min; 25 cycles of 94 °C 
for 15 s, 68 °C for 1 min 30 sec; an additional elongation cycle of 68 °C for 7 
min; and a holding cycle of 4 °C. Reaction products were loaded onto a 0.8 
% agarose gel (1X TAE) and PCR products of the correct size (1062 bp) 
were gel-purified using the Wizard PCR Preps DNA Purification System 
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(Promega), recovered in 30 pi sterile water according to the manufacturer's 
instructions and stored at -20 °C until subcloning. 
3.4.3 Subcloning of PCR Products 

The PCR product was subcloned into the topoisomerase I modified cloning 
vector (pCR-Bluntll-TOPO) purchased from the Invitrogen Corporation using the 
conditions specified by the manufacturer. Briefly, 4 |xl of gel purified PCR 
product was incubated for 15 min at room temperature with 1 jil of TOPO vector 
and 1 mJ salt solution. The reaction mixture was then transformed into E. coli 
strain TOP10 (Invitrogen) as follows: a 50 nl aliquot of One Shot TOP10 cells 
was thawed on ice and 2 |xl of TOPO reaction was added. The mixture was 
incubated for 15 min on ice and then heat shocked by incubation at 42 °C for 
exactly 30 s. Samples were returned to ice and 250 of warm (room 
temperature) SOC media was added. Samples were incubated with shaking 
(220 rpm) for 1 h at 37 °C. The transformation mixture was then plated on L- 
broth (LB) plates containing kanamycin (40 jig/ml) and incubated overnight at 
37 °C. 

3.4.4 Colony PCR 

Colonies were inoculated into 50 ixl sterile water using a sterile toothpick. A 10 
\i\ aliquot of the inoculum was then subjected to PCR in a total reaction volume 
of 20 |il containing 1X AmpliTaq™ buffer, 200 dNTPs, 20 pmoles T7 primer, 
20 pmoles of SP6 primer, 1 unit of AmpliTaq™ (Perkin Elmer) using an MJ 
Research DNA Engine. The cycling conditions were as follows: 94 °C, 2 min; 30 
cycles of 94 °C, 30 sec, 48 °C, 30 sec and 72 °C for 1 min 30 sec. Samples 
were maintained at 4 °C (holding cycle) before further analysis. 
PCR reaction products were analyzed on 1 % agarose gels in 1 X TAE buffer. 
Colonies which gave the expected PCR product size (1062 bp cDNA + 186 bp 
due to the multiple cloning site or MCS) were grown up overnight at 37 °C in 5 
ml L-Broth (LB) containing kanamycin (40 \ig /ml), with shaking at 220 rpm. 
3.4.5 Plasmid DNA preparation and sequencing 

Miniprep plasmid DNA was prepared from 1.5 ml cultures using the 
FastPlasmid™ Mini Kit (Eppendorf) according to the manufacturer's 
instructions. Plasmid DNA was eluted in 50 \i\ of elution buffer (10mM Tris-CI, 
0.1 mM EDTA, pH 8.5). The DNA concentration was measured using an 
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Spectromax 190 photometer (Molecular Devices). Plasmid DNA (200-500 ng) 
was subjected to DNA sequencing with the T7 and SP6 primers using the 
BigDyeTerminator system (Applied Biosystems cat. no. 4390246) according to 
the manufacturer's instructions. The primer sequences are shown in Table 1. 
Sequencing reactions were purified using Dye -Ex columns (Qiagen) or Montage 
SEQ 96 cleanup plates (Millipore cat. no. LSKS09624) then analyzed on an 
Applied Biosystems 3700 sequencer. 

Sequence analysis identified a clone containing 100% match to the predicted 
SCS0009 sequence. The sequence of the cloned cDNA fragment is shown in 
Figure 24. The plasmid map of the cloned PCR product (pCR-Bluntll-TOPO- 
SCS0009) (plasmid ID. 14893) is shown in Figure 25. 
3.4.6 Construction of mammalian cell expression v ectors for SCS00009 
Plasmid 14893 was used as a PCR template to generate pEAK12d (figure 27) 
and pDEST12.2 (figure 28) expression clones containing the SCS0009 ORF 
sequence with a 3' sequence encoding a 6HIS tag using the Gateway™ cloning 
methodology (Invitrogen). 

3 4 7 Generation of Gateway compatible SCS0009 ORF fused to an in frame 
6HIS tag sequence. 

The first stage of the Gateway cloning process involves a two step PCR 
reaction which generates the ORF of SCS0009 flanked at the 5' end by an 
attB1 recombination site and Kozak sequence, and flanked at the 3 1 end by a 
sequence encoding an in -frame 6 histidine (6HIS) tag, a stop codon and the 
attB2 recombination site (Gateway compatible cDNA). The first PCR reaction 
(in a final volume of 50 contains: 1 pJ (40 ng) of plasmid 14893, 1 .5 \i\ dNTPs 
(10 mM), 10 |JLl of 10X Pfx polymerase buffer, 1 \i\ MgS04 (50 mM), 0.5 |xl each 
of gene specific primer (100 jllM) (SCS0009-EX1 and SCS0009-EX2), and 0.5 
Hi Platinum Pfx DNA polymerase (Invitrogen). The PCR reaction was performed 
using an initial denaturing step of 95 °C for 2 min, followed by 12 cycles of 94 
°C for 15 s; 55 °C for 30 s and 68 °C for 2 min; and a holding cycle of 4 °C. The 
amplification products were visualized on 0.8 % agarose gel in 1 X TAE buffer 
(Invitrogen) and a product migrating at the predicted molecular mass was 
purified from the gel using the Wizard PCR Preps DNA Purification System 
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(Promega) and recovered in 50 \x\ sterile water according to the manufacturers 
instructions. 

The second PCR reaction (in a final volume of 50 contained 10 \i\ purified 
PCR 1 product, 1.5 pJ dNTPs (10 mM), 5 *il of 10X Pfx polymerase buffer, 1 ^ 

> MgS04 (50 mM), 0.5 \i\ of each Gateway conversion primer (100 (GCP 

forward and GCP reverse) and 0.5 yd of Platinum Pfx DNA polymerase. The 
conditions for the 2nd PCR reaction were: 95 °C for 1 min; 4 cycles of 94 °C, 15 
sec; 50 °C, 30 sec and 68 °C for 2 min; 25 cycles of 94 °C, 15 sec; 55 °C , 30 
sec and 68 °C, 2 min; followed by a holding cycle of 4 °C. PCR products were 

0 gel purified using the Wizard PCR prep DNA purification system (Promega) 

according to the manufacturer's instructions. 

3.4.8 Subclonina of Gateway compatible SCS0009 O RF into Gateway entry 
vector pDONR221 and expression vectors pEA K12d and PDEST12.2 
The second stage of the Gateway cloning process involves subcloning of the 
5 Gateway modified PCR product into the Gateway entry vector pDONR221 

(Invitrogen, figure 26) as follows: 5 yd of purified product from PCR2 were 
incubated with 1 .5 *il pDONR221 vector (0.1 ngW), 2 ^l BP buffer and 1 .5 fxl of 
BP clonase enzyme mix (Invitrogen) in a final volume of 10 |xl at RT for 1 h. The 
reaction was stopped by addition of proteinase K 1 |xl (2 pg/|xl) and incubated at 
z0 37 °c for a further 10 min. An aliquot of this reaction (1 nl) was used to 

transform £. coli DH10B cells by electroporation as follows: a 25 \i\ aliquot of 
DH10B electrocompetent cells (Invitrogen) was thawed on ice and 1 [i\ of the 
BP reaction mix was added. The mixture was transferred to a chilled 0.1 cm 
electroporation cuvette and the cells electroporated using a BioRad Gene- 
25 Pulser™ according to the manufacturer's recommended protocol. SOC media 

(0.5 ml), which had been pre-warmed to room temperature, was added 
immediately after electroporation. The mixture was transferred to a 15 ml snap- 
cap tube and incubated, with shaking (220 rpm) for 1 h at 37 °C. Aliquots of the 
transformation mixture (10 [il and 50 jxl) were then plated on L-broth (LB) plates 
30 containing kanamycin (40 ng/ml) and incubated overnight at 37 °C. 

Plasmid mini-prep DNA was prepared from 5 ml cultures from 6 of the resultant 
colonies using a Qiaprep Turbo 9600 robotic system (Qiagen). Plasmid DNA 
(150-200 ng) was subjected to DNA sequencing with 21M13, M13Rev and 
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SCS0009-SP1 primers using the BigDyeTerminator system (Applied 
Biosystems cat. no. 4390246) according to the manufacturer's instructions. The 
primer sequences are shown in Table 1. Sequencing reactions were purified 
using Dye-Ex columns (Qiagen) or Montage SEQ 96 cleanup plates (Millipore 
cat. no. LSKS09624) then analyzed on an Applied Biosystems 3700 sequencer. 
Plasmid eluate (2 pi or approx. 150 ng) from one of the clones which contained 
the correct sequence (pENTR-SCS0009-6HIS, plasmid ID 15057, figure 29) 
was then used in a recombination reaction containing 1.5 |xl of either pEAK12d 
vector or pDEST12.2 vector (figures 27 & 28) (0.1 pg / pi), 2 jll! LR buffer and 
1.5 |il of LR clonase (Invitrogen) in a final volume of 10 The mixture was 
incubated at RT for 1 h, stopped by addition of proteinase K (2 pg) and 
incubated at 37 °C for a further 10 min. An aliquot of this reaction (1 ul) was 
used to transform E. coti DH10B cells by electroporation as follows: a 25 *il 
aliquot of DH10B electrocompetent cells (Invitrogen) was thawed on ice and 1 
\i\ of the LR reaction mix was added. The mixture was transferred to a chilled 
0.1 cm electroporation cuvette and the cells electroporated using a BioRad 
Gene-Pulser™ according to the manufacturer's recommended protocol. SOC 
media (0.5 ml), which had been pre-warmed to room temperature, was added 
immediately after electroporation. The mixture was transferred to a 15 ml snap- 
cap tube and incubated, with shaking (220 rpm) for 1 h at 37 °C. Aliquots of the 
transformation mixture (10 jll! and 50 ijl!) were then plated on L-broth (LB) plates 
containing ampicillin (100 |ig/ml) and incubated overnight at 37 °C. 
Plasmid mini-prep DNA was prepared from 5 ml cultures from 6 of the resultant 
colonies subcloned in each vector using a Qiaprep Turbo 9600 robotic system 
(Qiagen). Plasmid DNA (200-500 ng) in the pEAK12d vector was subjected to 
DNA sequencing with pEAK12F, pEAK12R and SCS0009-SP1 primers as 
described above. Plasmid DNA (200-500 ng) in the pDEST12.2 vector was 
subjected to DNA sequencing with 21M13, M13Rev and SCS0009-SP1 primers 
as described above. Primer sequences are shown in Table 1. 
CsCl gradient purified maxi-prep DNA was prepared from a 500 ml culture of 
one of each of the sequence verified clones (pEAK12d-SCS0009-6HIS, plasmid 
ID number 15062, figure 30, and pDEST12.2-SCS0009-6HIS, plasmid ID 
15064, figure 31) using the method described by Sambrook J. et a!., 1989 (in 
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Molecular Cloning, a Laboratory Manual, 2 nd edition, Cold Spring Harbor 
Laboratory Press). Plasmid DNA was resuspended at a concentration of 1 \ig/\i\ 
in sterile water (or 10 mM Tris-HCI pH 8.5) and stored at -20 °C. 

5 Table V SCS0009 cloning and sequencing primers 



Primer 




SCS0009- 
API 


afir aTr rrr af^r nnr TGC CGC TGC CTG CAT CTC G 

V^V_«V-» iiOL Xw»w vvVi v^j-^j w***. «* v ' 


SCS0009- 
AP2 


AGT CAC GCC CAT GGA AGC CTT TGT CAC AGA ACT TGC 


SCS0009- 
AP3 


GCA AGT TCT GTG ACA AAG GCT TCC ATG GGC GTG ACT GC 


SCS0009- 
AP4 


TCA CAG TGC TGT GGT CTT TCC AGG CTC AGG GGG CAA GTC 


SCS0009- 
EX1 


AA GCA GGC TTC GCC ACC ATG CCC AGC GGC TGC CGC TG 


SCS0009- 
EX2 


GTG ATG GTG ATG GTG CAG TGC TGT GGT CTT TCC AG 


GCP 

Forward 


q r.r.r. ara AfiT TTR TAC AAA AAA GCA GGC TTC GCC ACC 


GCP 

Reverse 


GGG GAC CAC TTT GTA CAA GAA AGC TGG GTT TCA ATG GTG 
ATG GTG ATG GTG 


SCS0009- 
SP1 


TGA TGC GGC CTT GTG CTA AC 


PEAK12F 


GCC AGC TTG GCA CTT GAT GT 


PEAK12R 


GAT GGA GGT GGA CGT GTC AG 


21M13 


TGT AAA ACG ACG GCC AGT 


M13REV 


CAG GAA ACA GCT ATG ACC 


T7 primer 


TAA TAC GAC TCA CTA TAG GG 


SP6 primer | ATT TAG GTG ACA CTA TAG 



Sequence in red = overlap with adjacent exon. 
Underlined sequence = Kozak sequence 
10 Bold = Stop codon 

Italic sequence = His tag 

Example 4: Identification and description of SCS0009 domains 

4.1 identification 

15 A bioinformatic tool called SMART ihtto://smart. embl -heidelberg.de/) was used 

to identify the putative domains of SCS0009 and of the splice variant SCS0009-SV3. 
Results are shown in Figure 2. In addition, Prosite was also run on the sequences 
(frtto://us.expasv.ora/prosite/) . 
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Table VI Domains within the query sequ ence SCS0009 of 352 residues 

Confidently predicted domains, repeats, motifs and features: 
V I Jtaw&jT*: 'iSSfffi. I™*! ' E-value '] 
si gnal 'peptide ; 1 r 422 "., 



28 58 3 : 16e+01 

v EGF - 59 : 89 7.76e-03 

. ; EGF CA 93: 1:41 3.51e-02 

EGF- CA . . 143 ;179 : 6.64e-1 1 : 

- EGF CA 181 217 9.47e-Q7 

Inw complexity 218 233 

transmembrane 276 298. 

low complexity .306 322 - 

Table VII Domains within the query sequence SCS0009-SV3 of 377 residues 

Confidently predicted domains, repeats, motifs and features: 
; I n ^ e ;lT jlbegU? end ; E-value 
signal peptide 1 22 

EGF 28 58, 3.1.6e+01 

EGF 59 90 5.04e-02 

EGF CA 89 123 4.41 e-02 
EGF CA 129 166 3.32e-02 
EGF CA 168 204 6.64e-11 
EGF CA 206 242 9.47e-07 
low complexity 243 258 
transmembrane 301 323. 
low complexity 331 347 - 

Table VIII Domains within the query sequence SCS0009-SV4 of 134 residues 

Confidently predicted domains, repeats, motifs and features: 
j. " name. ; P^ffi end E-yalue 
signal peptide 1 22 - 
EGF 28. 58 3.16e+01 

EGF 59 89 7.76e-03 

EGF 94 129 1.23e-05 
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Partfal ScanProsite results: 
• SCS0009 

> PDOC00016 PS00016 RGD Cell attachment sequence [pattern] [Warning: pattern 
with a high probability of occurrence]. 
382 -384 RGD 

4.2 Description of the domains 

• EGF. Epidermal growth factor-like domain. Interpro annotation: 

A sequence of about thirty to forty amino-acid residues long found in the 
sequence of epidermal growth factor (EGF) has been shown MEDLINE: . 
MEDLINE:881 96363 . MEDLINE:841 1 7505 . MEDLINE:91 145344 . 

MEDLINE:85063790 . MEDLINE: to be present, in a more or less conserved 
form, in a large number of other, mostly animal proteins. The list of proteins 
currently known to contain one or more copies of an EGF -like pattern is large 
and varied. The functional significance of EGF domains in what appear to be 
unrelated proteins is not yet clear. However, a common feature is that these 
repeats are found in the extracellular domain of membrane -bound proteins or 
in proteins known to be secreted (exception: prostaglandin G/H synthase). 
The EGF domain includes six cysteine residues which have been shown (in 
EGF) to be involved in disulphide bonds. The main structure is a two -stranded 
P -sheet followed by a loop to a C-terminal short two-stranded sheet. 
Subdomains between the conserved cysteines vary in length. 

• GF_CA. Calcium-binding EGF-like domain. Interpro annotation: 

A sequence of about forty amino-acid residues long found in the sequence of 
epidermal growth factor (EGF) has been shown to be present in a large 
number of membrane-bound and extracellular, mostly animal protein s (see 
IPR000561 ) . Many of these proteins require calcium for their biological 
function and a calcium-binding site has been found to be located at the N- 
terminus of some EGF-like domains. Calcium-binding may be crucial for 
numerous protein -protein interactions. For human coagulation factor IX it has 
been shown that the calcium-ligands form a pentagonal bipyramid. The first, 
third and fourth conserved negatively charged or polar residues are side chain 
ligands. Latter is possibly hydroxylated (see IPR000152) . A conserved 
aromatic residue as well as the second conserved negative residue are 
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thought to be involved in stabilizing the calcium -binding site. Like in non- 
calcium binding EGF-like domains there are six conserved cysteines and the 
structure of both types is very similar as calcium -binding induces only strictly 
local structural changes. 

+ + + + 

I 111 

nxnnC -x (3, 14) -C-x (3,7) -CxxbxxxxaxC-x (1, 6) -C-x <8, 13) -Cx 

I I 

+ + 

•n': negatively charged or polar residue [DEQN] 
•b 1 : possibly beta -hydroxylated residue [DN] 

• a * : aromatic amino acid 

■C: cysteine, involved in disulfide bond 
'x 1 : any amino acid. 

• PS00016: RGP 

The sequence Arg-Gly-Asp, found in fibronectin, is crucial for its interaction 
with its cell surface receptor, an integrin. What has been called the 'RGD' 
tripeptide is also found in the sequences of a number of other proteins, where 
it has been shown to play a role in cell adhesion. These proteins are: some 
forms of collagens, fibrinogen, vitronectin, von Willebrand factor (VWF), snake 
disintegrins, and slime mold discoidins. The 'RGD' tripeptide is also found in 
other proteins where it may also, but not always, serve the same purpose. 
Conclusion 

In addition to the comments put forward in example 1, it is clear, based on 
SMART results (figure 32), that SCS0009, SCS0009-SV3 and SCS0009-SV5 
have a common domain organization. However, SCS0009, but not SCS0009- 
SV3 nor SCS0009-SV5, contains an RGD tripeptide sequence, indicating that 
SCS0009 may interact with integrin ceil surface receptors, which are involved in 
cell adhesion. SCS0009 and SCS0009-SV3 contain also a dileucine, an ER 
membrane retention signal (KKXX-like motif in the C-terminus: KTTA), as well 
as a probable zinc finger of C3HC4 type (identified by PSORT: 
httD://DSort,nibb.ac.iD/form2.html , results not shown). As such, SCS0009 shows 
distinct features in its signaling pathway compared with SCS0009-SV3 and 
SCS0009-SV5. Among the splice variants described, SCS0009-SV4 does not 
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contain a transmembrane domain. As such, it could represent a secreted 
protein, which may act as integral SCS0009 antagonist in vivo. In addition, 
when exposed to the action of an unknown protease, DLK1 generates a soluble 
peptide referred to as Fetal antigen 1 (FA1) (Jensen et al.). It is therefore 
probable that soluble SCS0009 and SCS0009-SV3 (as well as for other splice 
variants of SCS0009) can be generated through the action of FA1. These 
soluble SCS0009 polypeptides may also act as integral SCS0009 antagonist in 
vivo and might show particularly useful in the diagnosis and treatment of 
diseases, as described in therapeutic uses above. 



Example 5: Metabolic Endocrinology Assays Suitable for Exploration of the 
Biological Relevance of proteins Function 

A number of metabolic endocrinology-related assays have been developed by the 
Applicant and are of use in the investigation of the biological relevance of protein 
15 function. Examples of metabolic endocrinology -related assays that have been 
developed by the Applicant include four cell -based assays for metabolic endocrinology. 
These are discussed below. 

5.1 Differentiation to adipoc yte assay: 

Inhibition of adipocyte differentiation is an in vitro model for reduction of adipose 
20 mass believed to be important in reducing insulin resistance in d iseases such 

as diabetes and Polycystic Ovary Syndrome (PCOS). The goal is to identify 
protein(s) that inhibit differentiation of pre -adipocytes to adipocytes. The 3T3-L1 
mouse preadipocyte cell line is induced to differentiate to adipocytes with insulin 
+ IBMX. The finding that differentiation is inhibited by TNFct + cyclohexamide is 
25 used as a positive control. 

5.2 Tritiated glucose uptake (3T3 LP: 

The goal is to identify protein(s) that stimulate glucose uptake as a model for 
insulin-resistance in adipose during diabetes or PCOS. Adipocytes used are 
mouse 3T3-L1 preadipocytes that have been differentiated. 

30 5.3 Tritiated glucose uptake (primary human adip ocytes): 
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The goal is to identify protein(s) that stimulate glucose uptake as a model for 
insulin-resistance in adipose during diabetes or PCOS. Primary human 
adipocytes are used. 
5.4 Tritiated glucose update ( p rimary human skeletal muscle cells): 

The goal is to identify protein(s) that stimulate glucose uptake as a model for 
insulin-resistance in muscle tissue during diabetes or PCOS. Primary human 
skeletal muscle cells are differentiated into myotubes and then used in the 
assay. 
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